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ABSTRACT 


This thesis considers the problem of designing an optimal linear 
time-invariant dynamic compensator for the regulation of an n-th order 
linear time-invariant plant with m independent outputs. The initial 
plant state is characterized by its first and second moments, and the 
cost is the usual quadratic infinite-time penalty on the state and control, 
averaged over the initial plant and compensator states. 

The compensator is based on a minimal -order Luenberger observer and 
consequently has fixed dimension n-m. Necessary and sufficient conditions 
are derived for optimality of the compensator gains. The optimal compen- 
sator is shown to be unique if the plant has a particular canonical form, 
and, in general, for any arbitrary plant, the class of all optimal compen- 
sators is precisely determined. 

The main contributions of this thesis are to show that for this 
problem an equivalent of the ’'separation property" holds, and that the 
compensator gains are uniquely determined in terms of a unique design 
matrix, which, in particular, completely specifies the dynamics of the 
minimal-order Luenberger observer. In addition, several examples are 
worked out, and generalizations which treat any sort of initial plant 
and compensator states are provided. 


Thesis Supervisor: Timothy L. Johnson 

Title: Assistant Professor of Electrical Engineering 



ACKNOWLEDGMENTS 


It is a great pleasure to thank here Professor Timothy L. Johnson 
for his important contribution in the author’s education in control. 
His encouragements and patience, his technical insight and suggestions 
have played a fundamental role in making this research a rewarding 
experience. 

I am also indebted to Professor Leonard A. Gould, my Advisor. 


Many interesting discussions with colleague graduate students were 
of great help at various stages during this research. Loren K. Platzman, 
Wolf Kohn, and Pooi Kam deserve special mention. 

I would also like to thank Anne Clee of the Aeronautics Department 
for her fantastic job of typing this thesis, as well as Arthur Giordani 
and Norman Darling of the ESL Drafting Department for their help with 
the figures. 

This research was conducted at the Decision and Control Sciences 
Group of the M.I.T. Electronic Systems Laboratory, with support provided 
by the National Aeronautics Space Administration under Grant NGL-22-009-124 . 



4 


CHAPTER 

CHAPTER 

CHAPTER 


CHAPTER 


CHAPTER 


CHAPTER 


TABLE OF CONTENTS 


I INTRODUCTION 

II PRELIMINARY BACKGROUND 

2.1 Optimal Control and Similarity Transformations 

2.2 Observer Theory: Review of Results 

III DESIGN PHILOSOPHY: PRESENTATION OF THE APPROACH 

3.1 Introduction 

3.2 Formulation of the Problem 

3.3 Reformulation of the Problem 

IV THE OPTIMAL COMPENSATOR 

4.1 Statement of the Result 

4.2 Proof of the Result. Z Non-Singular 

4.3 Generalizations - Non-Singular 

4.4 An Example. E x Non-Singular 

4.5 Proof of the Result. Z x Identically Zero 

4.6 Proof of the Result. Z is Singular 

V THE CLASS OF OPTIMAL COMPENSATORS 

5.1 Introduction 

5.2 Equality of the Total Costs 

5.3 Optimal Compensator. Initial Plant 

5.4 The Class of Optimal Compensators for a Given Plant 

5.5 An Example 

VI CONCLUSION 


6 

12 

12 

18 

24 

24 

27 

32 

39 

39 

43 

50 

54 

61 

67 

78 

78 

80 

84 

88 

100 

108 


REFERENCES 


112 



LIST OF FIGURES 


5 


Pa 9 e 

1. Identity Observer 23 

2. Minimal -Order Luenberger Observer 23 

3 . Internal Structure of the Compensator 31 

4. The "Compact" compensator 31 

5. Percentage of Increase in the Cost (z q = 0) 60 

6. Percentage of Increase in the Cost (z = /T) 6° 

7. Geometric Interpretation of Aj(H^, H 2 >, Singular. 77 



6 


CHAPTER I 
INTRODUCTION 


In the past two decades,- a number of time domain methods have been 
proposed to regulate linear plants. The so-called ’’linear regulator prob- 
lem" has been extensively studied and the work of Kalman, [14], [15], [16], 
[17], must be mentioned here. Recall, however, that for this linear regula- 
tor problem, the optimal control requires the entire state to be generated. 

Cn the contrary, in most applications, one does not have a direct access to 
the complete state, but rather to part of it, through the output vector. 

In [12], [18], Levine solves the problem of finding the optimal output 
feedback controller, assuming that the plant is output stabilizable . Now, 
there exist cases, and we will see an example in Section 414, where constant 
output feedback gains cannot stabilize the plant (i.e., the plant is not 
output stabilizable ). In these cases, a dynamic compensation is required. 
Several methods have then been proposed for designing dynamic compensators 
to regulate n-th order linear plants with only m independent outputs. 

In a stochastic context, a Kalman-Bucy filter [19] of dimension n is 
used to give an estimate of the entire state from noisy measurements, and 
then the control is generated with this estimate via the optimal gains we 
would have for the noise-free linear regulator problem [20]; this is the well 
known separation theorem [21] for the linear quadratic Gaussian problem. 

In case the measurements are noise-free, but the plant is still driven by 
a noisy process, a Bryson -Johan sen filter [22] of dimensions n-m can be 
designed; the separation property holds in this case also, and Bucy [23] 
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proved the optimality of this design* 

Strangely enough , the state of affairs in a completely deterministic 
case is not so clearly defined. Mimicking what has been done in the 
stochastic case, a minimal-order Luenberger observer of dimension n-m, [2], 
[ 24 ], [ 25 ], is very often used to generate an estimate of the state from 
the available outputs, and then a control is generated with this estimate 
via the optimal gains for the linear regulator problem. Recall that if we 
know perfectly the initial state of the plant, a Luenberger observer can 
be designed, having arbitrary dynamics, that reconstructs perfectly the com- 
plete state; recall also that if we don’t know the initial plant state, then 
the error between the true state and the estimate of the state can be never- 
theless be made to decay arbitrarily fast. The above design philosophy is 
then certainly a reasonable one. It can also be shown that in this case, 
the total cost is composed of two parts: the optimal cost for the linear 
regulator problem, (i.e., the optimal cost we would have if we had complete 
state feedback) , and in addition, another term due to the fact that we have 
to reconstruct part of the state to implement the control. The increase in 
the cost incurred by the use of a dynamic compensator can then be precisely 
computed . 

The above design philosophy does have an important practical drawback: 
there is no way to "optimally" determine the dynamics of the observer part 
of the compensator. It was then suggested that an arbitrarily fast observer 
would make the incurred increase in the cost arbitrarily small. However, 
it was soon demonstrated by counter-examples [8], [ 9 ], [ 10 ], that this was 
obviously wrong. The designer, then, is left with his "engineering judgment" 
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and very often, a common design will be one with an observer slightly faster 
than the plant, to insure good tracking properties- The conclusion is that 
if we retain this completely deterministic formulation, there is no general 
optimal solution for the compensator dynamics- 

Notice now that very often we don't know the exact initial state of the 
plant. The idea (introduced in [ 12 ]) is then to design a compensator which 
will be optimal over a set of possible initial states for the plant. This 
leads to a design which is optimal in an "average sense." The fundamental 
results using this approach are given in [ 26 ]. In this paper, a new quad- 
ratic cost is introduced which weighs quadratically both the input signals 
to the plant and the input signals to the compensator. The optimal compen- 
sator gains are then found to be the solutions of coupled Riccati and 
Lyapunov-type equations. The main contribution of [ 26 ] comes from the fact 
that the approach taken does not require the compensator to have the dimension 
of the minimal order Luenberger observer (i.e., n-m) . In particular, an 
optimal reduced order compensator of fixed dimension s (s < n-m) can be 
found with this approach (if one exists) . A slight drawback of this method 
is that it provides no insight on the internal structure of the optimal 
compensator. 

Following this "average- sense" optimal philosophy, what we will do in 
this thesis is to reconsider the problem of using a minimal-order Luenberger 
observer to estimate the state of the plant, and then apply a control gene- 
rated with this estimate via the optimal gains for the linear regulator 
problem. However, we will look for a design which is optimal over some 
set of initial plant states. More precisely, we will consider the initial 
plant state as a random vector specified by its first and second moments. 
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As we will mainly consider time- invariant plants, the cost used will be the 
usual quadratic infinite time penalty on the state and control averaged 
over initial plant and compensator states. 

We note that this approach has already been taken by some authors. 
Newmann in [5], [6] took it, but was only able to derive a set of necessary 
conditions for optimality of the compensator gains. However, these condi- 
tions consisted of a large set of non-linear coupled equations of the 
Riccati-type, and as a consequence were not readily solved, so that no 
explicit structure for the optimal compensator was given. More recently, 

Rom and Sarachik in [7] have given a partially satisfying answer to the prob- 
lem. They actually identified the problem we are trying to solve with a 
steady-state stochastic control problem whose solution is known and requires 
the use of a Bryson-Johansen filter. Imposing some particular conditions 
on the plant and compensator initial states for their comparison to be valid, 
they actually showed by this indirect method one of the main results of this 
thesis, namely that if the plant has a particular canonical form, there 
exists a unique optimal compensator, and that an equivalent of the separation 
property holds for this problem. 

We will completely solve the problem in this thesis using a direct 
approach. Namely, we will find the optimal compensator leading to the 
smallest increase in the cost. Our results are more general than those of 
[7] because the proposed method can treat any sort of plant and compensator 
initial states, because a final answer is also given in the case where the 
initial output covariance matrix of the plant is singular, and most of all 


because the class of all optimal compensators for an arbitrary plant is 
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precisely determined. 

We will give now a brief survey of the different chapters. 

In Chapter II, the linear regulator problem is revisited, and the 
effect of a similarity transformation on the plant states is considered. 

Since the main part of this thesis will consider a particular canonical 
form for the plant, it is important to know how to go from an arbitrary 
plant to this canonical plant, and to know what the optimal design for 
this new plant becomes. Also a brief review of observer theory is given. 

In Chapter III, the design philosophy we have taken is exposed. One 
of the most important original parts of this chapter is the derivation of 
the initial error covariance matrix for the observer. This is truly a 
very important step towards a concise formulation of the problem. Another 
important part (although not original) is the computation of the total cost, 
which is shown to be composed of the two parts we previously mentioned. 
Particularly, the increase in the cost due to the need to reconstruct part 
of the state is clearly expressed- The problem is simply now to minimize 
this increase in the cost. 

In Chapter IV, the main result is stated and proved. The technique 
for the proof involves gradient matrices. The optimal compensator is 
shown to be unique, due to the canonical form we have taken for the plant. 
Different cases are considered, depending on the non -singularity or singular- 
ity of the initial output covariance matrix. Numerous examples are pro- 
vided as well as generalizations to treat different possible plant and 
compensator initial states. 

In Chapter V, we return to the original arbitrary plant. Namely, we 
precisely determine the class of all compensators which are optimal for 
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the given plant (all of them leading to the same increase in the cost, 
and the same dynamics for the overall system plant + compensator). 

In Chapter VI, a possible extension of our results towards a compen- 
sator of lower order s (s < n - m) is presented. 
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CHAPTER II 

PRELIMINARY BACKGROUND 


2 . 1 Optimal Control and Similarity Transformations 

In this paragraph we shall derive some results about similarity 
transformations of the linear t ime -invariant , completely controllable and 
completely observable system: 

x ( t ) = Ax(t) + Bu(t); y(t) = CxCt); x(0) = (2.1.1) 

where x (an n-vector) is the state of the plant, 
u (an r-vector) is the control vector, 
y (an m-vector) is the output of the plant. 

The matrices A , B, c have respective dimensions (nxn), (nxr), (mxn) , 
and we will assume that C has full rank, (i.e., Rank C = m) . 

Assume the initial state of the plant is distributed according to: 


E {x } - 0; E {x x 1 } - £ 
o o o o 


( 2 . 1 . 2 ) 


where ^ is a positive semi-definite matrix. 

Due to the stochastic initial state, consider the following cost: 
00 


J 


« E 


( 


{x' (t) Q x(t) + u 1 (t) R u (t) } dt 


(2.1.3) 


where Q is an nxn positive semi-definite matrix, and R is an rxr positive 

_ _l/2 

definite matrix. Moreover, assume (A, Q ) is observable. 

With these conditions, it is well known [l], that the optimal complete 


state control for the cost (2.1.3) and the plant (2.1.1), is given by: 



13 


u* (t) 


R 


B* n x(t) = K*x(t) 


where n is solution of the Riccati equation: 


ha + a 1 n + q - hbr 1 B , n 


0 


and where we have defined: 
K* = - R B' n 


(2.1.4) 


(2.1.5) 


( 2 . 1 . 6 ) 


The expected optimal cost is in this case: 

A 

J* = tr(M ) (2.1.7) 

o 

whereas the closed-loop dynamics are governed by: 

x(t) = (At BK*) x(t),- x(0) = x (2.1.8) 

o 

being distributed as in (2.1.2). 

Note that even if E{x } = x and cov (x , x ) = £ , then the optimal 

o o o o o 

control is still given by (2.1.4). However the optimal cost becomes in 
this case: 

J* = tr S(x x’ + f ) (2.1.9) 

o o o 

Now for the main part in this thesis, we will be interested in using 
a particular canonical form for the plant (2.1.1) , in which the output 
matrix has the following form: 


C 


[ 1 


mxm 


° , , 1 

mx (n-m) 


( 2 . 1 . 10 ) 


\ 
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This is absolutely no loss of generality because having assumed that 
Rank C - m, one can always find an nxn non-singular matrix P such that 
C = CP ^ . However, the original system is not very often given in this 
particular canonical form, and it is interesting to be able to transform 
an optimization problem given 5, R, A, B, C into an optimization problem 
given Q, R, A, B, C such that the dynamics of the two closed-loop optimal 
systems are identical- The problem is then to find P such that C = CP ^ , 
where C is given by (2-1.10), and to relate Q and R to Q and R. 

A - Construction of j? 

From the condition: Rank C = m, 

the m n-dimensional row vectors c^, i = 1 , . ..m, are linearly independent. 

Define now the new state-variables: 


Write C = 


c , 

l 


m 


x . (t ) = c . x(t), i = 1 , ...m; x , (t ) = t . x(t), j = m + 1 , ...n 

11 3 J ( 2 . 1 . 11 ) 


where the t.'s are n-m n -dimensional row vectors, such that the c.’s and 
D i 

t_. ' s are linearly independent and form a basis of R n . 


Write T = 


m+i 

t, 

3 


n 


; then (2.1.11) can be written: 


x (t) = P x (t) 


x (t) 


T 


( 2 . 1 . 12 ) 
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By construction, the output relation for the plant (2.1.1) is now: 

y (t) = y (t) = C x ( t ) = CP -1 x (t) (2.1.13) 

where C is given by (2.1.10) and P by (2.1.12). 

Note that the matrix P is not unique, as T can be chosen arbitrarily, 
as long as the c. ’s and t 1 s are linearly independent and form a basis of 
R n . 

Under the similarity transformation P, the system (2.1.1) now becomes: 


x { t ) = A x(t) + B u ( t ) ? y (t ) = C x(t); x(0) - x^ (2.1.14) 

E{x .} = 0 ; E { x x 1 }— 1 (2.1.15) 

o o o o 


where we have the following relations: 


- -1 - - -1 

A = PAP ; B = PB; C = CP ; x = Px ; E = P£ P ? (2.1.16) 

o o o o 

B - Relations between Q, R, Q, R. 

For the new system (2.1.14), we would like to reformulate the optimi- 
zation problem we solved for the plant (2.1.1) associated with the cost 
(2.1.3). In other words, we would like to find the matrices 0 and R 
(respectively positive semi-definite and positive definite), such that the 


cost : 


J = E 


00 

j f tx' (t) Q x (t ) + u' (t) R u ( t ) } dt | 


(2.1.17) 


associated with the plant (2.1.14) leads to the same closed loop dynamics 


as those given by (2.1.8). 
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’We will give now a sufficient condition for this to be true. 

Theorem . Consider the two following plants related by the similarity 
transformation P: 


x(t) = A x(t) + B u ( t ) ; y(t) = C x (t) ; x(0) = (2.1.1) 

E{x } = 0; E{x x'} = E (2.1.2) 

o o o o 


associated with the cost functional: 


J 



CO 

{x 1 (t) Q x(t) + u 1 (t) R u ( t ) } dt 

o 


and 


(2.1.3) 


x(t) - A x ( t ) + B u (t ) ; y(t> - C x(t); x(0) = x (2.1.14) 

o 

E{x } = 0 ; E{x x 1 } = Z (2.1.15) 

o o o o 

where we have the relations: 

A = PAP” 1 ; B - PB; C = CP*" 1 ? x = Px ; Z = PE P 1 (2.1.16) 

O O O 0 

associated with the cost functional: 

00 

{x' (t) Q x (t ) + u' (t) R u (t ) } dt 

0 

Then a sufficient condition for the plant (2.1.14, 15) associated 
with the cost (2.1.17) to have the same optimal closed loop dynamics as 
the plant (2.1.1, 2) associated with the cost (2.1.3), is that: 

Q = (P ' ) _1 Q P _1 and R = R 




(2.1.18) 
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Moreover, in this case: 

n = (P 1 )” 1 fi P -1 and K* = K*P _1 
where we have defined 

k* = - r " 1 b' n 

and finally, the optimal expected costs are equal: 

J* = J* 

Proof. The proof is very simple and relies on the fact 
cient condition for A - BR B T JI and A - BR ^B'H to have the 
is that: 

A - BR _1 B'I[ = P (A - BR -1 B'n)P _1 
Now using (2.1.16), we can write: 

A - BR _1 B'II = P(A - BR~^B' P 1 JIP) p” 1 
so that an obvious choice is to take: 

R = R and II = (P') _1 TTP -1 
Now n is given by: 

n a + a f n + q - nBR -1 © 1 n - o 

so that plugging (2.1.22) in (2.1.23) and making use of (2.1 
{ P ’ ) ~ 1 nAP _1 + (P') _1 A’ HP -1 + Q - (P’) _1 fi’BR -;L B , nP~ 1 = 


(2.1.19) 

( 2 . 1 . 20 ) 

( 2 . 1 . 21 ) 

that a suffi- 
same eigenvalues 


( 2 . 1 . 22 ) 

(2.1.23) 
.16) yields: 


0 


(2.1.24) 
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A left multiplication of (2*1.24) by P' and a right multiplication 
by P yields: 


HA + A 'II + P'OP - nBR“ 1 B , ii = 0 (2.1.25) 

Now a comparison of (2.1.5) and (2.1.25) leads one to choose: 
e = <p*r 1 Q P _1 

Now obviously K* = - R can be rewritten as follows: 

K* = - R -1 B , P' (P 1 ) -1 ffp _1 , that is K* = K*P _1 . 

The equality of the two optimal costs is easily proved by expressing 
II and E q in terms of ff and E q , making use of the trace identity tr (AB) = 
tr (BA) . 

J* = tr(JIZ ) = tr {(P') _ 1 np _1 PE P'} 

0 o 

= tr {(P , ^ 1 ^E o P■ } = tr {(P’) -1 P'iiE o } 


= tr (fff ) = J* 
o 


Q . E . D . 


Remar k . Note that we have only here a sufficient condition. This 
comes from the fact that a sufficient condition for A and B to have the 
same eigenvalues is that A and B be similar, (i.e., A = TBT~^ where T is 
non-singular) . The condition is obviously not necessary as can be easily 


seen by choosing A = 



and 



We have now a complete picture of the effect of a similarity trans- 
formation on an optimal control design using complete state feedback. 


2.2 Observer Theory: Review of Results 

We will now briefly state some results about observer theory to intro- 
duce our notations. For a detailed proof of these results, the reader is 
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referred to the tutorial paper by Luenberger, [2]. 

A - Complete State Observer 

Theorem . Corresponding to an n-th order completely observable linear 
time invariant plant , an identity observer can be constructed having arbi- 
trary eigenvalues. 

Consider the linear time -invariant dynamical system: 

x (t ) = A x(t) + B u(t); y(t) = C x(t) (2.2.1) 

where (A, C) is a completely observable pair. 

We can define x(t), the estimate of x(t)/ such that the error: 

e (t) = x(t) - x (t) (2.2.2) 

has arbitrary dynamics. We will start from the dynamical system: 

X (t ) = F x ( t ) + L u ( t ) + H y(t) (2.2.3) 

and choose by comparison with (2.2.1) 

F = A - HC and L — B (2.2.4) 

where H is an nxm design matrix. 

Using (2.2.4) and subtracting (2.2.3) from (2.2.1)/ we obtain: 

e (t) = (A - HC) e (t ) (2.2.5) 

and x(t) is governed by: 


x(t) = A x(t) + B u(t) + H[v{t) - C x(t)] 


( 2 . 2 . 6 ) 
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New if (A, C) is a completely controllable pair, we see immediately, 
[3], that H can be chosen such that A - HC has arbitrary eigenvalues. 

A block diagram representation of the identity observer is shown in 
Figure 1 , page 23. 

B - Minimal Order Luenberger Observer 

There is a redundancy in the previous design, because if Rank C = m, 
then we already know exactly m linear combinations of the state, and we 
don't need to reconstruct all the n states* 

Theorem. Corresponding to an n-th order, completely observable, linear 
time invariant plant, having m independent outputs, a state observer of 
order n-m can be constructed having arbitrary eigenvalues. 

We have already seen in paragraph 2.1, that if Rank C = m, then we can 
always find a non -singular matrix P, such that after similarity transforma- 
tion the system {2.1.1) becomes: 

x(t) = A x (t ) + B u (t ) ; y (t ) = C x(t) (2.2.7) 


where C has the following structure: 
C = [I 


mxm 


0 / J 

mx (n-m) 


Define now x = 


*1 

*2 


where X]_ is an m-vector and X 2 is an (n-m) 
vector. With this definition, y(t) = C x(t) becomes y(t) = xj. (t) . 

Now partition accordingly A and B to obtain: 



- 

- 


” — 


All 

A 12 


Bl 

A = 

a 21 

a 22 

; B = 

A 
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where An is mxm, A| 2 mxfn-m) , A 2 x (ri“m)xm, A 2 2 (n-m) x (n-m) , mxr and 

B 2 (n-m)xr. 

<2.2.7) can then be written: 

Xi (t) = An Xi <t) + A 12 x 2 (t) + Bi u ( t ) 

x 2 (t) = A 21 xx (t) + A 2 2 x 2 (t) + B 2 u ( t ) (2.2.8) 

y (t) - xx (t) 

Note that we perfectly know xj (t) so that all we need is^ to estimate 
X 2 (t) . Rev/rite (2.2.8) in the following form: 

x 2 (t) = A 22 X 2 (t) + v (t) ; v { t ) - A 2 i x l (t) + B 2 u(t) (2.2.9) 

Y 2 (t) = Ax 2 x 2 (t) ; y 2 (t ) = xx (t) - A x x X I (t) - B L u (t) (2.2.10) 

and consider v(t) and (t) as new control and observation vectors for 
x 2 (t). Then we can apply the theory developed in A - to estimate x 2 (t) . 

Write x 2 (t) = F x 2 (t) + Lv(t) + Hv 2 (t) , and choose F = A 22 - HA 12 , 

L = I where H is (n-m)xm. 

We obtain: 

i< 2 (t) = A 22 X 2 (t) + v (t) + HA 12 [x 2 (t) x 2 (t)] (2.2.11) 

so that by subtracting (2.2.11) from (2.2.9) we have: 

e(t) = (A 22 " HA 12 ) e(t) (2.2.12) 


where 
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e (t) - x 2 (t) - x 2 (t) (2.2,13) 

It can be easily shown that if (A, C) is completely observable, then 
so is (A 22 * Ai 2 ). Then by appropriate choice of H, e(t) can be assigned 
arbitrary dynamics. 

Now from (2.2.10) we see that y 2 (t) - A} 2 x 2 (t) contains a differ- 
entiation of the data x^ (t) . To avoid this, define: 

z (t ) = *2 <t) - H XI (t) (2.2.14) 

It is then easy to see that z (t) satisfies: 

* 

z(t) = F z(t) + G x ^ (t) + D u(t) (2.2.15) 

where we have defined: 

F “ A 22 ” ^12 

G - FH + (A 2 1 - HA U ) (2.2.16) 

D = B? - HB 1 

A block diagram representation for this minimal order Luenberger 
observer is shown in Figure 2, page 23. 

Important Remark. Note that this observer depends only on a single 
design matrix, namely H. The main goal of this thesis will be to optimally 
determine this matrix, in some precise sense. 





Minimal or 
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CHAPTER III 

DESIGN PHILOSOPHY: PRESENTATION OF THE APPROACH 

3 . 1 Introduction 

We have seen in Chapter II, paragraph 2.1, that one can easily 
translate a complete state optimization problem, given a realization 
(A, B, C) of a plant and a cost involving Q and R, into another equivalent 
complete state optimization problem involving another realization (A, B, C) 
of the same plant, associated with a cost involving new weighting matrices 
Q and R. 

If we use the similarity transformation x(t) = P x(t) between the two 
realizations, then we showed that one should choose: Q = (P 1 ) 1 QP 1 and 

R = R in order to achieve the same design. 

With this result in mind, we will concentrate for the remainder of this 
chapter and Chapter IV on the following canonical plant: 

x(t) - A x(t) + B u(t); y (t ) = C x (t ) ; x(0) = x (3.1.1) 

o 

where x {an n-vector) is the state of the plant, 
y (an m-vector) is the output of the plant, 
u (an r -vector) is the control vector. 

We will assume furthermore that is a random variable specified by 
its first and second moments: 

E {x } = 0 E {x x 1 } = E 
o o o o 


(3.1.2) 
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In (3.1.1), A is an nxn matrix, B an nxr matrix, C an mxn matrix having 
the following canonical form: 

C = [I 0 , ■! (3.1.3 

L mxm mx (n-m) 


Assume also that (A, B) is a completely controllable pair, that (A, C) 
is a completely observable pair, but that the plant is not output stabili- 
zable, so that a dynamic compensator is required if we have only access 
to y (t) . 

Assume now that we are given the cost functional: 



{x 1 (t) Q x (t ) + u ' (t) R u (t) } dt 


(3.1.4) 


o 

where Q is an nxn positive semi-definite matrix, (A, Q 1 ^ 2 ) a completely 
observable pair, and R is an rxr positive definite matrix. 

Supposing we have access to the complete state, the optimal control 
is then given by: 


u* (t ) = K* x(t) = - R 1 B'll x(t) 


(3.1.5) 


where II satisfies the Riccati equation: 

TIA + A’Jl + Q - n BR - 1 B'n = 0 (3.1.6) 

The optimal closed -loop system is then: 

x (t ) = (A + BK*) x (t) ; y (t) = C x(t); x(0) = X q (3.1.7) 

E {x } = 0 E {x x' } = £ 
o o o o 


(3.1.2) 
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and the optimal cost for this design is: 


- tr(II£ ) 
1 o 


(3.1.8) 


Note that from (3.1.7) we have: 


(A + BK*)t 

x (t) = e x 


(3.1.9) 


and that, as a consequence, J* can be rewritten: 


J? = tr 


oo 

! fe (A + BK *>' t (0 + 


(Q + K* ’ RK* ) e tA + BK * Jt at I q ' 


so that II satisfies also: 


II(A + BK*) + (A + BK*) r n + Q + K* ' RK* = 0 (3.1.10) 

Now in practice we don't have access to x(t), but rather to y(t). 

What we intend to do is: 

- Use an observer to form £(t), an estimate of x(t). 

- Then apply the control u (t) = K* x(t), where K* is as in (3.1.5). 
This approach has been taken by many authors [ 4 ], [ 5 ], [ 6 ], and assumes 

a sort of "separation property". However, up to now, no precise answer 
has been given to the question: what are the optimal dynamics to be chosen 

for the observer part of the compensator? We will solve completely 
the problem in this thesis, and show that a separation property effectively 
holds. 

Impo r t an t Remar k : In [7] was recently given a partial answer to this 

question, assuming restricted possibilities of initial conditions for the 
plant and the compensator. The answer is derived, however, by identifying 
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the deterministic problem we are trying to solve with a stochastic steady- 
state control problem, and as such does not constitute a direct approach 
to the question. We will return later on the solution proposed by Newman, 

[ 5 ], [ 61 . 

3.2 Formulation of the Problem 

Define as we did in Chapter II, paragraph 2.2, 

A 11 A 12l [ B 1 

} B = 

a 21 a 22 _ ' b 2 

(The reader is referred to the previous chapter for the dimensions 
of vectors and matrices and the derivation of the minimal order Luenberger 
observer. ) 

In view of (3.1.3) and (3.2.1), we have then y(t) = C x(t) = x\ (t) , 
so that a minimal order Luenberger observer can be implemented to estimate 
X 2 (t) . We obtain: 

x 2 (t) = z(t) + H Xx (t) (3.2.2) 

z(t) = F z (t) + G Xi (t) + D u(t) (3.2.3) 

F — A 22 “ HAj2 

G = FH + (A 2 i - HA U ) (3.2.4) 

D = B 2 - HBi 

The error e(t), defined as: 



(3.2.1) 


e (t) = x 2 (t) - x 2 (t) 


(3.2.5) 
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satisfies the following differential equation: 

e(t) = F e (t ) (3.2.6) 

Define now: 


K* = [K* K*] (3.2.7) 

where K* is as in (3.1.5), K* is rxm, and K* rx(n-m). Noting that x(t), 
the estimate of x(t) is: 


x(t) 


x i<t> 

i 2 M 


we then want to implement the control u(t) 


(3.2.8) 

K* x(t), which becomes: 


u(t) = K* x 1 (t) + K* x 2 (t) 
that is using (3.2.2): 


(3.2.9) 


u(t) = (K* + K* H) x x (t) + K* z(t) (3.2.10) 

We can now use this value for u(t) in the equation (3.2.3) to obtain: 

z (t) = F z(t) + G x 1 (t) (3.2.11) 

where we have defined 

F = F + DK* 

2 

(3.2.12) 

G = G + D(K* + K* H) 


Assume for the present time that we start the compensator from z (0) = 0. 


We shall see in the sequel that this condition can be removed. 
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A block diagram of the overall design is shown in Figure 3, page 31, 
indicating clearly the two parts of the compensator. In Figure 4, page 31 , 
a compact representation is shown, corresponding to the compensator we 
will practically implement. 

Define now the augmented state: 

Ut) = (3.2.13) 

z (t) 

then using (3.1.1, 2) and (3.2.10, 11) the overall closed loop system is 
given by: 
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Now from (3.2.14), C (t) is given by 


5<t) = e* C 


(3.2.18) 


where C = 
o 


Z = E {(?'} = 
O 0 0- 


a, ( 

J ? (H) = tr j j 




since we assumed z (0) = o. Defining now: 
, then: 

i 


r e o 

o 


OJ 


}'t J 


fie dt Z 


° 1 


(3.2.19) 


so that if A is the solution of the following Lyapunov equation: 


% Or 

A $ + 4>’A + fi = 0 


(3.2.20) 


X. 


then (H) will be given bys 


J„(H) = tr (K Z ) 
2 o 


The increase in cost due to the use of a compensator will then be: 

A A 

A J (H) = of (H) - J* = tr(X Z ) - tr (n E ) 

2 1 O 0 

where J* is the optimal cost when we have complete state feedback. 

A 

^V, 

We explicitly showed the dependence of (H) on H. Our main goal will 
be to choose H so that Aj^ (H) is minimum. However, it turns out that even 
though the variables x(t) and z{t) are the natural variables to work with, 
as in practice we will implement z (t) , they are not suitable for actual 
pencil and paper calculations, as they lead to intractable equations. We 
shall then reformulate the problem in terms of x(t) and the error e (t) = 

(t) - x 2 (t) . 
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3 .3 Reformulation of the Problem 

We will work now in terms of x(t) and e(t). Note that 



r 


r- ~ 


— — 


x i (t) 


(t) 


xCt) 



— 


= [I -N] 



X 2 ^ 


x 2 (t) - e (t) 

nxn 

e (t) 


where we defined; 


N 


°mx (n-m) 

* (n-m)x (n-m) ^ 


(3.3.1) 


We can then rewrite the control u(t) = K* x(t) as: 


u(t) = K* x ( t ) - K* N e(t) 


(3.3.2) 


Define now the new augmented state: 
x(t> 


e <t) = 


e (t) 


(3.3.3) 


Then we can make use of (3.1.1, 2), (3.2.6) and (3.3.2) to write the over- 

all closed-loop system in the following form; 


lit) = r Ut); 5(0) =5 

o 


(3.3.4) 


where V is given by: 


r = 


A + BK* -BK* N 


0 


F 


(3.3.5) 


We note that this procedure is a formal procedure, as obviously we 
cannot implement u(t) given by (3.3.2), because we don't know x (t) but only 
y(t). However, we see immediately the theoretical advantage of using x(t) 
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and e(t): V given by (3,3.5) shows us clearly that the overall system 

will have the poles of the optimal closed- loop system assuming we have 
complete state feedback, associated with the poles relative to F. This 
wasn’t clearly apparent in view of $ given by (3*2.15). 

Now as previously we can evaluate the increase in cost. Using u(t) 
given by (3.3.2), we can express J 2 (H) in terms of 5 (t) as: 

CD 

I f r (t) a ? (ti at ! ( 3 . 3 . 6 ) 


V"> = s | j 


where we have defined: 


Q = 


Q + K* * RK* — K* ' RK* N 

-N’ K*' RK* N ' K* ' RK* N 


{3.3.7) 


Now 


from what we did previously, it follows from (3.3.4) that; 


5(t) = e rt 5, 


where we have = 


[::] 


Defining now H = E {E, £ ' } , J~ (H) can be rewritten: 
o oo z 


J 2 (H) = tr 


e r,t e rt dt 5 | 

° l 


(3.3.8) 


(3.3.9) 


so that if A is the solution of the Lyapunov equation: 


A T + T'A + Q. = 0 


(3.3.10) 


then J 2 (K) will be given by: 


J 2 (H) = tr (A E q ) 


(3.3.11) 
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We will now evaluate precisely J_ (H) . We then need to evaluate 5 and 

o 

A. For the computation of A, we will mainly follow Yuksel and Bongiorno, 


A - Evaluation of 5 

o 

Recall from (3.2.2) that x 2 (t) = z (t) + Hx^t), and from (3.2.5) that 
e(t) = x 2 (t) so that e(t) is given by: 

e ( fc ) = x 2 (t * “ H x 1 ( fc ) " z(t) (3.3.12) 

Now x q is such that E {x q } = 0, so that both E {x 1Q } = 0 and 
E ^ X 20^ = Si 1106 we have also assumed that the initial state of the 

compensator is zero (z (0) = 0) , we then obtain from (3.3.12) 

E = 0 (3.3.13) 
so that 


E 

o 


E {e e 1 } = HE H r 
o o 11 


+ E 


22 


HE 


12 


12 


H' 


where we have partitioned E^ as: 



Z, 

I 


11 

12 

z = 



0 

E* 

£ 


L 12 

22 J 


E 


11 ' 


£ 10 / having the same dimensions as A lW A , A 

±z z z 11 12 22 

Now E {x Q e^ } will be given by: 


“12 


E {x e'} 
o o 




. . 



_ 




x io 



*12 

- z n H, “ 

E 



e' 







o 

i 

E ^ 

- E' H 1 



20 



22 

12 







(3.3.14) 


(3.3.15) 


(3.3.16) 
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so that we finally obtain: 


o 


E { W 




(3.3.17) 


where £ is the initial plant covariance matrix, E is the initial error 
o o 

covariance matrix and is given by (3.3.16). 

Important Remark. Note that E q is a quadratic function of H, and not 
a simple positive semi-definite matrix. 


B - Evaluation of A 

We will now evaluate A, the solution of the Lyapunov algebraic equa- 
tion (3.3.10) where (2 is given by (3.3.7) and V by (3.3.5). 

Partition A as: 


A = 



12 


(3.3.18) 


where is (nxn) , A^ is nx(n-m) and A^ is (n-m)x(n-m). Then the solu- 

tion of (3.3.10) requires the solution of the three following equations: 


An (A + BK*) + (A + BK*)' A + Q + K* ' RK* = 0 (3.3.19) 

(A + BK*)' A + A 12 F - A BK*N - K* ' RK*N =0 (3.3.20) 

A F + F' A - A BK*N - N ' K* ' B ' A + N'K*’ RK*N = 0 (3.3.21) 

^ ^ ^ -L. jL* X ^ 

Now by comparison of (3.3.19) and (3.1.10), we see immediately that 
A^l = II. Using = II and K* = - R 1 B'll in (3.3.20) yields: 
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(A + BK*)’ A + A 12 F = 0 (3.3.22) 

Now we know that (A. + BK*) is stable and we desire a stable F, as F 
governs the dynamics of the error e(t). We then conclude that A + BK* and 
F are non singular, which implies = 0. 

Using now = 31 and A^ 2 = 0 in (3.3.21) yields: 

A 22 F + F’ A 22 + N ' K* ' RK*N = 0 (3.3.23) 


which has a unique positive definite solution, still because of the required 
stability of F. 

Finally, we see that A becomes: • 


A = 


JI 

0 



(3.3.24) 


so that using (3.3.17) for E , the cost J (H) can be rewritten: 

o 2 

J 2 (H) = tr(AE Q ) = tr(nZ Q ) + tr(A 92 E Q ) (3.3.25) 

where A 22 is the solution of (3.3.23) and E q is given by (3.3.14). Recall- 
ing that J^* — trdlZ^) we see immediately that: 


AJ (H) = J (H) - J* = tr (A„_E ) 
Z. X ZZ O 


(3.3.26) 


Note that A 22 depends on H through F, and that is a quadratic 
function of H. Define: 


M = N 1 K* ' RK*N 


(3.3.27) 


which is independent of H, the problem on hand is then the following: 
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given AJ 2 (H) = tr(A 22 E o ), where: 

A F + F'A + M = 0 (3.3.28) 

22 2 2 

F = A 22 - HA 12 and E q (H) = H' + - H (3.3.29) 

find H* such that V h ^ H* then 

aj 2 (H*) <_AJ 2 (H) 

Remarks 

1. The approach we have taken here, considering the variables x(t) 

and e(t) is not original. in particular, Yuksel and Bongiomo [4] showed 

also that AJ 2 (H) = tr(A 22 E o ). However, up to now people have considered 

E as a positive semi-definite matrix independent of H, and consequently 
o 

have tried to minimize simply trCA^). Now from (3.3.28), we see imme- 
diately that this philosophy leads one to choose F having greater and 
greater negative eigenvalues. Some authors then asserted that AJ^ (H) 
will go to zero as the eigenvalues of F are pushed towards infinity. It 
was soon shown by counterexamples [8] > [9], [10] that this was obviously 
wrong in some cases. However, no precise solution was given to find the 
optimal H*. 

2. Newman in [5], [6] uses the natural variables x(t) and z (t) to 
tackle the problem. However, he does not take his plant in the canonical 
form we have chosen. As a consequence, he ends up with a formidable set 
of non-linear matrix equations which are not readily solvable, and his 
conclusion is that the best control is given by u(t) = Kx{t) where f 
This conclusion is wrong, and his error was also recently noticed and 


K* . 
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corrected by Rom and Sarachik, [7], 

3* What we simply noticed is that E is in fact a quadratic function 

o 

of H, and that instead of trying to reduce the "size" of A , we must 
minimize the function: 

AJ_ <H) = tr {A (H) E (H) } 

2L 2L O 

The problem is now well posed and we can give a precise answer, leading 
to an optimal design with minimum increase in the cost. 
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CHAPTER IV 

THE OPTIMAL COMPENSATOR 


4.1 Statement of the Result 

We want now to solve the following minimization problem: 

Given solution of the following Lyapunov equation: 

A 22 F + F'A + M = 0 (4*1-1) 

where : 


F = R 22- HA 1 2 (4 - 1 - 2) 
find H* such thatV H ^ H* 

aj 2 (h*) <_aj 2 (h) 

where 

AJ 0 (H) = tr (A„_E ) (4.1.3) 

2 22 o 

and E q is given by: 

E (H) = HE H' + £ - HZ - E' H 1 (4.1.4) 

o 11 22 12 12 

Recall that E^(H) is positive semi-definite, being the initial covariance 
matrix of the error. 

We will now state the following theorem. 

Theorem. 1. Assume that the initial output covariance matrix I 
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is non-singular. Then there exists a unique minimizing H for the incre- 
mental cost (4.1.3), namely: 


H* 


12 


v i \ 
"12' 


v- 1 

"11 


where W is the unique positive definite solution of the Riccati-type equa- 
tion: 


A W 
*22 


WA 22 


+ E 


22 


- <*12 


E' ) 
12 ; 


r -l 

"ll 


(WA 


12 


E i 2 > 


= 0 


2 . Assume now that is singular: then no minimizing H exists, and . 

the incremental cost (4.1.3) can be made arbitrarily small by appropriate 
choice of H. 

We will give now some preliminary results that we will need to prove 
the theorem. 

As we are looking for a stable F, we can write from (4.1.1): 


22 


' “a'V 1 «22-®12 ,t 

e Me dt 


(4.1.5) 


Recall that M does not depend on H. The cost (H) can now be rewritten. 


us in 


inrf +■ Kd +• 


race identity tr (AB) = tr (BA) : 


( 

AJ (H) = tr M . 

( J 


*22 ‘*"12 (A 22 HA 12 ) ' t ) . 

E (H) e dt / (4.1.6) 

o 


Note that Aj^ (H) is a real function of (n-m)xm variables (i.e., the 
h^’s, w ^ ere H = [^jD* A necessary condition for H* to minimize such a 


function is that — AJ (H) 
dH 2 

the derivative. 


= 0. We shall simply calculate and evaluate 


H* 


A key lemma in this calculation is the following due to Kleinman, [11]. 
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Definition: g(.) is a trace function of the matrix X, if g(X) is of 

the form: 


g(X) = tr[G(X) ] 

where G(.) is a continuously differentiable mapping from the space of rxn 
matrices into the space of nxn matrices. 

Lemma: Let g{x) be a trace function. Then if we can write: 

g (X + e AX) - g (X) = z tr[Q(X) AX] 

as e 0, where 0(X) is an nxr matrix, X is an rxn matrix, then we have: 

f- g (x) = Q 1 (x) 


The calculations we will do follow closely those of Levine, [ 12 ] • But 

let's first show a simple application of this lemma. 

* A 22 ~ ^ 12 ^ 

Example : Let G(H) - e .We want to compute: 


dH 


9 (H) = | tr[G(H)] } 


We have 


G(H + e AH) = e 


(A 22 - t » 12 - e AHA 12 )t 


(4.1.7) 


But from p. 175, reference [13], we have that to the first order in e: 

t 


, „ , iA 22 - “ll 11 

G(H + £ AH) = e - £ 


(A 22 - HA 12 )(t - 0> 


(A 22 - Itt 12 )a 
AHA e da 

12 


(4.1.8) 


\ 



42 



W e can now proceed through the proof of the theorem, with the lemma 
and example for guidance. Statement 1 is proved in Section 4.2, and 
Statement 2 of the theorem is proved in Section 4.6. 
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4-2 Proof of the Result. I non-singular- 


Recall from (4-1-6): 


AJ (H) = tr M 

( l 


(A 22 - ^ 12 ^ ' (A 22 
e E (H) e 

o 


m n yt 


at (4.2.D 


where : 


E (H) = HE H’ + 1 - HI - 1 1 H' 

o 11 22 12 12 


(4.2.2) 


We want to compute the quantity AJ^ (H + eAH) 
apply Kleinman's Lemma. 

From (4.2.2) we have to first order in e: 


AJ 2 (H) in order to 


E (H + eAH) = E (H) + e AH (I H 1 - I ) + e( Z-.H' - I_ 0 )' AH* 
o o 11 12 11 12 


(4.2.3) 


and from (4.1.8) , also to the first order in ei 


<*22 - m i2 - e4Hft 12 !t Pt 

e = e _ e 


e F<t a) AHA 12 e F ° da (4.2.4) 


so that Aj^ (H + eAH) is given by: 


AJ 2 (H + eAH) = tr j M 


(F - eAHA )t (F - eAHA ) ' t 

E (H + eAH)e dt 

o 


(4.2.5) 

Using (4.2.3) and (4.2.4) we can now rewrite (4.2.5) retaining only the 
i first order terms in e: 


Aj ? (H + eAH) = tr j M 


Ft F ' t Ft F 1 1 

e E (H) e + ee AH (I H r - E..)e 
O 11 1 2 


Ft F’t F (t - a) Fa F‘t 

+ ee ( I H 1 - JhJ' AH’ e - e e v AHA^e °do E (H)e 

11 12 12 o 


ee Ft E (H) [ e F ' (t 0) A' AH' e F ' ° do 1 dt j 

° i 12 J 


(4.2.6) 


0 
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Now using tr (AB) = tr(BA) and subtracting (4.2.1) from (4.2.6), we 
obtain : 


_ ( f r 

AJ 2 (H + eAH) - J 2 (H) = £ trj M 2 ( E , , H ' - E„„) 


11 ‘ 


J 12 


A.. e F ° E (H) e 1 " 1 e F(t a) doAH 
12 o 


-I* 


e F ^ ~ E (H) e F t e F ° do AH 1 dt 
12 o 


F’t Ft 
e e AH 


So that we have by application of the lemma: 


dH AJ 2 (H) 


F 1 1 Ft 

e e (E 11 H ' - V dt 


0 

oo t 


J 


dt 
o o 


- F' (t - a) Ft F'c , , 

da e e E (H) e A* 

o 12 


w 

-I 


dt 

0 0 


, F’o Ft F' (t - a) , 

do e e E (H) e A' 

o 12 


M (4.2*7) 


Now consider 


0 = 


« t 

r 

dt 

o o 


F'a Ft ^ F * (t - a) 

da e e E .(H) e 
o 


and make the following change of variables: 


o = t - o^j da = - da^; a = 0, = t; a = t, a = 0 

00 t 


then 0 becomes 0 = 


dt 

o 0 


F' (t - O ) Ft . . P °1 
da e 1 e E (H) e 

1 o 
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So that (4.2,7) nov 7 becomes; 


d 

dH ”2 


AJ„(H) = j 2 1 e F<t e Ft (E il H’ - E^) ' dt 


(' 


«> t 

r 

dt 
o o 


, F’a Ft 
da e e E (H) e 
o 


F ' (t - o) | 

A ' ^ / M 


■H 


, F'a Ft , . 
da e e E (H) e 
o 


12 I 

F' (t - a) 


00 t 

Consider now x 

0 0 

we will interchange the order of integrations in x to obtain: 


X = 


GO 00 

r 

da 


1( F'a Ft F' (t - a) 

dt e e E (H) e 
o 


o a 


We will now make the change of variables; 


( 4 . 2 . 8 ) 


t - t - a; di = dt; t - a, T « 0? t = °®, t 

OO 00 

da 


X = 

o o 
or finally 


F 1 o Fa Ft F x 

dx e e e E (H) e 
o 


X = 


F’a Fa , 
e e da 


Ft F 1 T 

e E (H) e dt 
o 


so that (4.2.8) can be rewritten: 


dif 1J 2 (H) 


= 2 


F 1 1 Ft 
e e dt 


['V - 


? ) ' 

L 12’ 


Ft F' T 

e E (H) e dx A' 

O 12 
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We will now set — AJ_ (H) = 0, to obtain 
dH 2 


he u - e ; 2 - 1 /T v hi /,T at *; 2 


(4.2.9) 


Define now 


W = 


Ft F 1 t 

e E (H) e dx 
o 


(4.2.10) 


Then if we assume that is non-singular, (4.2.9) can be rewritten: 


H* - <W * I i2 ) 


(4.2.11) 


We see from (4.2.10) that W satisfies the algebraic Lyapunov equation: 


FW + WF' + E (H) = 0 

O 


(4.2.12) 


Now recalling that E q (H) = HE H' + Z - HZ 12 - E^H' , that F = A 22 - HA , 
and making use of (4.2.11), we can rewrite (4.2.12) as: 


*22“ + “*22 + E 22 - ( "*i 2 * E i 2 > E li < Wa i 2 * E ;2>' ‘ ° < 4 - 2 - 13 > 


Note that E q (H> is positive semi -definite, and that we are looking for 
a stable F, so that W, the solution of the Lyapunov equation (4.2.12), is 
positive definite, and unique from the linearity of the Lyapunov equation. 
Equivalently, W is the unique positive definite solution of the Riccati- 


type equation (4.2.13). 
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This uniqueness of H* is of prime importance , as it insures, that 

given A, B, C = [I 0 , J and the cost (3„1.4)., there is only one 

mxm mx(n-m) 

optimal compensator for our problem and that its realization is completely 
determined by Eq. (3.2.10, 11). We will see in Chapter V that when the 
output matrix C of the plant does not have this canonical form, then a unique 
optimal compensator no longer exists, but rather a class of optimal compen- 
sators (all of them leading to the same increase in the cost) can be pre- 
cisely determined. 

We can compute now the minimum increase in the cost, which will be 
given by: 

AJ* = tr (A* E*) (4.2.14) 

2 22 o 

where is the solution of 

A 22 F * + F *' A 22 + M = 0 (4.2.15) 

in which F* = A„ - H*A and where E* in (4.2.14) is given by: 

22 12 o 

E* = H*E., H* 1 + £ 0 - £' H* f (4.2.16) 

o 11 22 12 12 

Important Remarks. 1. Note that H*, through W and (4.2.11) depends 
uniquely on the initial state covariance matrix Z^, (i.e. , Z ^]_2 ' ^22^' 

We will see clearly with an example, that if we take two different initial 
state covariance matrices, one "small" and another "large", reflecting 
two different degrees of knowledge of the initial state, then F* = A 0? - 
H* A i2 7 which governs the dynamics of the error x^ (t) - x 9 (t) , will have 


/ 
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much faster dynamics when we take a large 3C q . This corresponds to what one 
would expect, the initial error (large in this case) being quickly reduced 
to allow the observer to still track closely (t) and achieve a good control. 

2. The independence of H* with respect to the closed loop dynamics 

of the optimal system assuming complete state feedback, justifies completely 
the sort of separation property we assumed in our design, namely: estimate 

the state, and then apply the control u(t) = K*x(t). 

In other words, we can change Q and R in the cost (3.1.4) so as to have 
a better and better regulation of the state, but if we still assume the same 
initial state covariance matrix for the plant, then H* will be the same for 
all designs. The estimator-observer part of the design is independent of 
the control part., showing clearly that an equivalent of the separation 
property holds for this problem. 

3. Note that there may be solutions of (4.2.12) which are not solutions 
(4.2.10). We then need to show that there exists a stabilizing H for F = 

A - HA ; if this is true then W will be the unique positive definite solu- 

/ -L ^ 

tion of (4.2.13) . 

We will simply compare our problem with a filtering problem whose solu- 
tion is known. Namely, consider the following observable, stochastic linear 
system: 

x 2 (t) = A 22 x^(t) + £ (t) 
z (t) - A x 2 (t) + n (t) 

where £ (t) and n(t) are Gaussian processes such that: 


E { £ (t ) } - 0 


E (n(t)} - 0 
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' E (5 (t)e* <T) > = z 22 6 <t - T > ; E(n(t)n , (T)} - 5(t - t) 

E {n(t)? 1 (t)} - Z 12 6(t — T ) 

for all t and t, and where > 0 and 

Then it can be readily seen that (4.2*13) and (4.2.11) are precisely 
the equations for the error covariance matrix and gain of the steady-state 

Kalman filter we would implement to obtain an estimate of x 2 (t) from the 

noisy measurement z (t) . 

Now, the dynamics of this steady-state Kalman filter would be: 

i 2 <t) - (S 22 - ha 12 ) i 2 (t) ♦ ha 12 «<t> 

and it is well known that with the assumptions we have made about % 22 ' 

and the observability of the pair (A^, A 12^ ' the matrix F ” 
governing the dynamics of the filter is stable ([30], [31]). 

We can now return to our problem and conclude that there exists a 
unique minimizing H given by (4.2.11). This H will lead to a stable F = 
and consequently insure that (4.2.10) and (4.2.12) have the 
same solution, if we take for W the unique positive definite solution of 


(4.2.13) . 
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4.3 Generalizations - Z Mon-Singular 

Up to now, we have considered only the initial conditions: E {x^} = 0, 

E {x x 1 } = Z , z =0 (see Chapter III, Paragraphs 3.2 and 3.3). Neverthe- 
o o o o 

less, it may very well happen that we know that x q is always disturbed in 
a particular way so that E {x^} = 'X r ^ X Q ^ 0) . It may also happen that 
for a practical reason we cannot start the compensator from z^ = 0. 

We will see now that the generalizations to handle these particular 
initial conditions are very simple. We will still assume that Z^ is not 
singular . 

Generalization 1: z ^ 0; E {x } = 0; E {x x 1 } = Z 

o o o o o 

Under these conditions e = x„ - Hx, _ - z is such that: 

o 20 10 o 

E {e } = - z ; E {e e*} ~ E (H) + Z 
o o o o o o 

where E (H) is as previously and Z = z z 1 „ We see that the complete 
o o o o 

demonstration goes through replacing E q (H) by (H) + 

The optimal H is then given by: 

H* = (W A’ 2 + Z , 12 ) Z^J (4.3.1) 

where W satisfies now: 


A 22 * + ” A 22 + Z 22 


+ z - 


A 

(W 


12 


l 1 ) 

12 


.-1 

► 

"11 


A 

(W 


A 1 
12 


y r ) 
h 12 } 


= 0 


(4.3.2) 


It is clearly seen from the previous equation giving W f that starting 
from z^ / 0 is in fact equivalent to adding extra uncertainty in the value 
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of x . This makes sense because if we know that E {x } = 0, we should 
20 o 

start from E {e Q } “ 0. As we will see in an example, this extra uncertain- 
ty will imply a faster response of the optimal compensator, to recover 
more rapidly from the initial larger expected error. 

The increase in the cost is given in this case by: 

t\j '\j r v, 

AJ* = tr (A* E*) 

2 22 O 

where: 


E* = H* £ H* 1 + E _ + Z - H* E - Z* H*’ 
o 11 22 o 12 12 


(4.3.3) 


"o 

and A* 2 is the solution of: 


A 22 F * + F *' A 22 + M = 0 


(4.3.4) 


in which F* = A 22 - H* A.^* 

Generalization 2. E {x } = x , cov (x , x ) = Z , x x' = X 
o o oo oooo 

Assume however that we can start the compensator from - H x^; 

this assumption insures E {e } = 0 and E{ee , }=E as previously, 

o o o o 

(4.1.4) . 

The optimal compensator will then be the same as for the general 
derivation , due to the fact that E (H) is the same. In this case, H* is 
then given by (4.2.11) where W satisfies (4.2.13). The increase in cost 
is then also the same and given by (4.2.14). 
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Note that these results are due to the fact that we assumed the 

possibility of "tuning" z q to take the value z^ =* - H*x^ Q . 

Generalization 3. z ^ 0 arbitrary, E {x } = x , cov (x , x ) = I , 
o O O o o o 

XX* = X , z z ' - Z . 
o o o o o o 

Under these conditions, we obtain: 

E { e } = e = x - Hx - z 
o o 20 10 o 


E {e e’} = E (H) = H{E - + x. _x’ ) H* + E + Z + x ort x' - x„ z* 
o o o 11 10 10 22 o 20 20 20 o 


- H(E 12 + x 1Q x^ 0 - x 1Q z^) - (E 12 + x 10 x 20 “ X 10 H ' 


Define now 


E n E n + X io x io 


E. = E,_ + x x' - x z 1 
12 12 10 20 10 o 


Z 22 = + Z o + X ™ X ™ ' x ™ z : 


then E (H) = HE H' + E„„ - HE, _ - E’ H' 
o 11 22 12 12 


The answer is then immediate: 


H. = (5 A i2 * S i2 ) E-J 


where W satisfies: 
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a 22 5 ♦ 5 a 22 + s 22 - (ii a ; 2 * f ; 2 j f ii <* a ; 2 * f i 2 > ■ - 0 


The increase in cost is given by: 


AJ* = tr (A* E*) 

z zz o 


where A* is the solution of 


l 22 + X 22 + M = ° 
with F* = A 22 - H* A and 


E* 

o 


H* Z H* 1 + 




From this study of different initial conditions/ we conclude that 

whatever the value of E {x } is, the minimum increase in the cost will be 

o 

obtained if we can "tune” the initial state of the compensator so as to 

insure E {e } = 0. If we cannot set z to this desirable value , the 
o o 

increase in cost will be bigger/ and the optimal compensator faster, to 
reduce more rapidly the initial bigger expected error. 
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4 . 4 An Example* Z Non-Singular 

Assume we are given the completely controllable, completely observable, 
non output stabilizable, linear time' invariant system: 


_ — 


r 


p 




p 

X (t) 


0 1 


x, (t) 


0 


x, {t) 

1 

= 



1 

+ 


u(t) y (t) = [1 0] 


X (t) 

_ * 


0 0 


_ x 2 (t) . 


1 




Note that this system is already in the canonical form we need. 


Assume E 


r 9 

0 

and p = 


l_o 

10 

feedback is: 



0 E {x x * } = I - 
o o o 


"l 0“ 

_ o i_ 


and also that 
so that the optimal control assuming complete 


r = 1 
state 


u* (t) 


-3-4] 


x x (t) 
X 2 (t) 


i.e., K* = [k* k*] = [ - 3 - 4] 


leading to the optimal closed -loop system: 


(t) 


0 

\ 

r~H 


x 1 (t) 

/2 (t> . 


-3 

-4 


X (t) 



_ 

_ 


_ 


which has two stable modes = - 3 , = - 1. 

The optimal cost is 


J* = tr(III ) = 16 

1 o 
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where II is given by: 


H = 


12 

3 


3 

4 


Suppose now that we don’t have access to x^t) and x 2 (t) but only to 
y(t) = x^(t). A dynamic compensator must then be constructed. 

1. Assume we start the compensator from z^ = 0. The general deriva- 
tion then applies. From (4.2.13) w is computed to be: w = 1, so that 
from (4.2.11) : h* = 1- 

From (3.2.3, 4) we then obtain 


z(t) = - z (t) - x^(t) + u(t) 


(4.4.1) 


and from (3.2.10) 


u (t) = - 7 x <t) - 4 z(t) 

so that the "compact" optimal compensator becomes: 


(4.4.2) 


z (t) = - 5 z (t) - B x^(t) 
u (t ) = - 7 x (t) - 4 z (t) 

leading to the overall closed loop system, 


■x x <tf 


1 — 

O 

(—1 

o 


-X x (t)- 

*2 (t) 

= 

-7 0 -4 


x 2 (t) 

_z(t) _ 


-8 0 -5 


z(t) 

^ — 
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whose modes are s n = - 3 , = - 1, and s_ = - 1. As expected, and 

12 3 12 

correspond to the poles of the optimal closed loop system assuming complete 

state feedback, and corresponds to the pole associated with f* - ~ 

3 22 

h*a^ 9 = - 1 governing the error* Namely, we have e(t) = - e(t). 

We see that for this optimal design the error because of the "small" 
value of does not decay really fast, contradicting all the heuristic 
arguments in favor of a fast observer* 

The increase in cost is given by: 


AJ* 


A* 

22 


E* 

o 


= 16 


where E* = 2 from (4.2.16), and A* - 8 from (4.2.15) in which M = 16* 
o 22 

(Recall M = N'K*' RK*N, and in this example N = jj’J , K* = [ - 3 -4], 

r = 1). 

Now for this simple academic example, we can compute easily AJ (h) = 

2 

*„(h) E (h). 

2^ o 

From (4.1*1) f we have A (h) = ~ and from (4.1.14) E (h) - h 2 + 1 , 

22 h o 

so that : 

A J (h) = 8(h + - ) 
z n 


Recall that the optimal cost assuming complete state feedback is 
J* = 16; we can then very easily evaluate the percentage of increase in 
the cost due to the fact that we had to use a compensator: 
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T (h) 


AJ 2 (h) 





A sketch of t (h) as a function of h is shown in Figure 5, page 60- 

We see clearly on this sketch that h* - 1 leads to the minimum percent- 
age of increase (in this academic example, 100%), and that a too small or 
a too big h lead to high increases in the cost, one because the estimation 
of (t ) is very poor leading to a poor control u(t) = K*x(t), the other one 
because too much effort is spent to do an unnecessarily good estimation of 
x 2 (t). 

2. Assume now that for some practical reason, we cannot start the 
compensator from = 0, but instead that = / 5 \ 

Obviously this is a real disadvantage because now E {e^} - - ^3 
instead of zero. We then impose on the observer part of the compensator to 
start with a large initial error. We would then expect the optimal compen- 
sator in this case to be faster than in the previous design where E {e^} “ 
This is precisely what happens. 

From (4.3.2) and (4.3.1) we have w = 2 and h* = 2 so that f* = - 2 and 
for this design e(t) = - 2 e(t). 

The equations for the compensator are: 


z(t) = - 2 z(t) - 4 x^t) + u(t) 

u(t) = - 11 x^(t) - 4 z (t) (4.4.3) 


so that we will implement the compact compensator: 
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z(t) = - 6 z(t) - 15 x^(t) 

associated with (4.4.3). 

The overall optimal system becomes: 


ft 


0 

1 

o' 

x 2 (t) 

= 

-11 

0 

-4 

_z(t) . 


-15 

0 

-6. 


corresponding to the following modes 



The increase in cost is in this 
Aj; - 32 


'x^ (t) 

x 2 (t) 

_z (t) _ 

= - 3, Sj = - 1# associated with 
case: 


As previously, we can draw a sketch of the percentage of increase 
-v A3 2 (h) 

t (h) = — ~ — f {see Figure 6, page 60.) 

Wo have in this case from (4.3.3) and (4.3.4): 


^ 8 Or 2 

A_ 0 (h) = - and E (h) = *i + 4 
2. 2 Jl O 

so that x (h) = (h + i ) . 

2. h 

We see that now the minimum percentage of increase is 2 00%. It is 

then very important to be able to start the compensator so as to insure 

E {e } = 0. 
o 
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Remark* One should not attach too much importance to the numbers in 
this academic example , which has been constructed so as to insure integer 
values of all the parameters. We will see in Chapter V another example 
where the percentage of increase is of the order of 10 

3. Assume now that £ = W ? I but that z =0. 

o 0 4 o 

The uncertainty in x ^ is now bigger and one would expect a faster 
compensator than the one found in 1. 

This is exactly what happens, and it can be seen that w = 2 and h* = 2 
for this example, so that f* - - 2. 

The increase in the cost is now AJ* = 32. 
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4 . 5 Proof of the Result- ^ Identically Zero 

We will return now to the proof of the main theorem, and study the 
case where £ is identically zero, before we tackle the case where T . ^ is 
simply singular. 

Assume E = E t x j_Q X ^Q^ = As we aiso assumed E {x^} = 0, this means 

that x = 0 with probability one. We will simply say that we know perfectly 
that x = 0, Note that under these conditions, the initial covariance 
matrix takes the form: 


l - 
o 


0 



(4.5.1) 


If we still assume z =0 for the compensator, then from (4.1.4) , we see 
that now: 



and E does not depend any longer on H. 
o 

The increase in cost becomes: 


(4,5.2) 


AJ 2 (h) = tr(A 22 E 22 ) (4.5.3) 

where is still the solution of the Lyapunov equation: 

A 22 F + F * A 22 + M = 0 (4,5.4) 


It should be obvious that the only way to reduce AJ^ (H) is now to try 
to make tr(A^ 2 ) as "small 15 as possible- We will show that for this parti- 
cular case of perfect knowledge of the initial state x ^q' there no 
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minimum for AJ^ (H) , and that the increase in cost can be made arbitrarily 
small, by choosing F to have greater and greater negative eigenvalues , and 
consequently choosing a "greater and greater" H. 

From (4.1*5), we have 


(A 22 HA 12 ) ' t (A 22 HA 12 )t . 
A 22 = I e Me dt 


so that AJ 2 (H) = ti: (A 22 ^2'>^ becomes: 


aj 2 (H) 


tr |" J 


' W 22- H “l2 )t r <S 22 - HA 12 ) ' t „ I 

Z 22 6 dt j 


We will now give simply a few indications for the demonstration as we 
will mainly follow what we have previously done. 

To the first order in e, we have: 


AJ^(H + eAH) = tr{ M II e Z 22 e - e 


uu 

J[ 


Ft 


F’t 


F(t - 0) a tjt* F0. _ F*t 

e Aha i2 e do Z 22 e 


Ft _ 
- c e Z 


22 


F 1 (t - a) f Atll F 1 a 1 , 4 
e A^ 2 AH' e da dt 


We see immediately that all the corresponding steps will be the same, 
where we will set E^ = 0 and I = 0 in the equations. 

We finally end up with: 


d_ 

dH 


aj 2 (h) 


F't Ft 
e e dt 


Ft F 1 1 

e E 22 e dt A’ 2 M 


(4.5.5) 
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Define U by 


i F f t Ft 
U = e e dt 


(4.5.6) 


then U is a solution of the algebraic Lyapunov equation: 


UF + F f U + I = 0 


(4.5.7) 


Define V by: 


V = 


Ft _ F't 
e S 22 6 dt 


(4.5.8) 


then V is a solution of the algebraic Lyapunov equation; 


VF * + FV + E ^ ~ 0 (4.5.9) 

Note that I is positive definite, that is positive semi-def inite , 

and that we are looking for a stable F, so that solutions U and V are 
required to be positive definite. 

We can now rewrite (4.5.5) as 


d 

— AJ (H) = - 2U V A'. M (recall M >_ 0) 

ClH 2 1.2 

where U > 0, V > 0, and are only functions of H through F = A ?0 - HA^ - 
We then see clearly that AJ^ (H) is a monotonically decreasing function of H, 
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and that no effective minimum exists. 

Mathematically , the increase in cost can be made arbitrarily small. 
However, in practice, we won’t implement a compensator which is "too fast", 
because in this case it will tend to act as a differentiator, and small 
measurement noise or plant noise (not modeled in this deterministic approach) 
will give rise to spurious signals. 

Generalization. Identically Zero 

Assume E {x } = x , that is E {x in } = x A E {x } = x , but still 
O O 1U JLU zu zU 

cov(x^ Q , = 0 and cov (x^, x^q) = ^22* ^ans that we know 

perfectly that x^ will take the value 

Define x 22 - ^ Z c - z, 

We see immediately that the error is such that: 


E {e } = x or . - H x - z 
o 20 10 o 

There are two possibilities here: either we can tune z^ to take the value 

“ H so that E {e^} = 0, and the design will be the same as 
previously (no minimum, and possibility of making Aj^ (H) as small as we want) 
or if is arbitrary but fixed, we obtain: 


E (H) — H X , 
o 11 


H' + E 


22 


+ X 


22 


+ Z - H(X 10 - x 
o 12 


10 


z 1 ) - 
o 


(X 


12 


- x 


10 


Z o y 


H' 


so that E (H) appears again as a quadratic function of H. The answer is 
immediate: if is non-singular (i.e., no component of x^ is zero), then 

there will be a unique minimizing H* . 

Define 5 12 - X 12 - z^, ^ - E 22 + x 22 + Z o . Then H* will be given 

by: 
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H* 


- <w A h 


x ; 2 ) 


■it 


(4*5.10) 


where W satisfies: 


22 


W + WA 


22 


+ x 22 - (wa ; 2 


x ia ) 


‘it 




+X i2 1 ' * 


(4.5.11) 


Note that there exists a unique minimizing H* as soon as E^ (H) is a 
quadratic function of H, if the quadratic term possesses a positive definite 
matrix coefficient. 

Example: We will show now what happens to the example in (4.4.1)/ 

page 55, if we take = 0, that is ^ J / with still E {x^ Q } = 

E {x 20 } = °* 

It is easily seen that in this case E q = a ^ = 1 and that as previously 

Q 

A 22 (h) = h ' S ° that: 

Si 2 ,h> = I 


The percentage of increase becomes now: 


t (h) 


AJ 2 (h) 


J 1 


1 

2h 


and can be made arbitrarily small by choosing sufficiently large h. 

The engineer is then completely free to choose the dynamics of the 
compensator. Other practical considerations will lead to choice of H* . 

Assume now E {x 1A l = 1, E {x } = 0, z = 0. It is easily seen that 
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2 

E (h) - h + 1 in this case, leading to a unique minimizing h, for 

° 2 

AJ ,(h) = 8 - . 

2 n 

A 

Namely from {4.5.10, 11) we have w = 1, h* = 1 and the optimal 
compensator follows . 
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4.6 proof of the Result. Z is Singular 

We will assume now that Z^ is singular but not identically zero. 
Without loss of generality , we can rearrange the components of x^<t) to 
read: 


x x (t) 
x i ^ 

where x^t) is an m' vector (m f < m) and (t) an m-m f vector , such that 
E {x^ } - 0 and x^ Q is perfectly known to be zero. Accordingly Z^ becomes 


x x (t) = 


where 



is an m v xm f 
As a consequence 


0 

0 

non- singular matrix. 

' E {x 10 x 20 } = E 12 bec ° meS: 


z 


12 



where Z^ is an m’x(n-m) matrix. 

The global initial state covariance matrix becomes now: 


Z * 
o 


11 


12 


0 

0 

0 


12 


22 


Given this new partitioning of Z and Z^ , 
tion H and A accordingly. 


it is interesting to parti- 
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Define H = [H^ H^] where is (n-m)xm 1 and is (n-m)x (m-m 1 ) ; 

define also: 



where A is m'xfn-m) and A^ 2 is (m-m 1 )x (n-m) . 

Note that with these new definitions (H) becomes, (as can be readily 
seen) : 


V B > - "l hi H i + £ 22 - H 1 f 12 - hi H i 


(4,6.1) 


so that in fact E^ (H) is simply a function of H^. From this simple fact, 
we can immediately predict that there will be no effective minimum in this 
case either. 

Assume first that H = [H is fixed . We can still solve for W 

given by (4.2.12) where now E q (H) = E q (H^); namely we have: 

FW + WF 1 + E _ ( H 1 ) = 0 (4.6.2) 


in which F = A 22 - HjAjj - H 2 A 12 

Assuming W > 0 is found from (4.6.2), let's see what happens to the 
necessary condition for a minimum given by (4.2.9): 


(4.2.9) 
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Namely we have now: 


H £ 

- E * 

0 " 

_ 

r wa * . 

'b -i 

WAJ J 

1 11 

12 



12 

i2 J 


We no longer have the possibility of satisfying (4.6*3) , except in 
the very special case: A^ ” 0- briefly study this case before 

we proceed for the general case. 

'Xf 

Special Case: A 12 = 0. Note that F becomes accordingly F A 22 

and does not depend on ■ As from (4.6.1), does not appear in E^, 
we will simply set H 0 = 0. H* is now simply given by H* = [ H* 0], where 
we have : 


H i 


= (W, 


A ' 

12 


v » \ 

12 } 




being given by: 


*22 W 2 + W 1 A 22 + E 22 


<“l + 42> (W 1 + f i 2 > ' " ° 


and an effective minimum exists for this very special case. 

General Case; We will now return to the general case, and clearly 
show the different roles played by H and H 2 , by computing separately 

h 2 ) and d^ Ad( V V- 


These derivatives may be written directly: 
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3^ AJ 2 ( V V 


-f e FT E (H. ) e F ' T dt A' 1 M 


(4.6.4) 


3hJ AJ 2 (H 1' V 


"t Ft _ f 
e dt 


Ft F 1 t 'v 

e fT E (H ) e* dr A‘ M 
o 1 12 


(4.6.5) 


where : 


r - *22 - H 1 *12 - H 2 *12 


VV * H 1 E 11 H i + £ 22 - H 1 S !2 - =12 H i 


As expected, we see that there is no way to set - AJ (H , H ) to be 

d 2 1 2 

equal to zero* This derivative is always negative, and as a consequence 

AJ^ (H^ , ) is a monotonic decreasing function of • 

On the other hand, assume that a particular has been chosen* Then 

from (4*6*4), we see that for this particular , there exists a unique H* 

for which AJ 2 (H , H ) - 0. 

1 H 1 

This suggests clearly that for any arbitrary H , say H , there exists 

2 2a 

an "optimal" H* such thatVii ? H* we have: 
la 1 la 


4J 2 (H L' H 2„> i 4J 2 (H l’ “2a 1 


This "partially minimizing*' H* is easily computed by defining A, 
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A - H A r so that we have immediately: 
2 2 2a 12 


H* = (W A 1 + S- ) 

la a 12 12 11 


( 4 * 6 . 6 ) 


where W satisfies: 
a 


W , *22a + A 22a “a + J 22 ' '“a *i 2 + 4 2 > hi '"a *i 2 + E i 2 > ’ ’ ° 

(4.6.7) 

The important fact to remember is that when i s singular , but not 

identically zero, not any arbitrary choice of a "large" H will lead to a 

small cost. The procedure is thus to fix on the basis of engineering 

judgment and then choose H*^ solution of (4.6.6, 7). 

We will soon give a way to choose a reasonable H.^, kut f^rst we w m 

show on a three-dimensional sketch, how AJ^H^, H 2 > behaves as a function 

of H and H . See Figure 7, page 77. 

-L ^ 

From the sketch, it is clear that in any hyperplane of constant t^, 

AJ (H , H ) is a monotone decreasing function of H . On the contrary in 
2 12 z 

any hyperplane of constant H , Aj 2 (H , H 2 > possesses a single minimum for 

H = H* 

1 la 

Proposed Method for a Choice of 

We propose now a possible way of choosing H^. ^ ur P ro ^ em comes 
from the fact that Z^ is singular: the idea is to consider a fake Z^ 

which is no longer singular. In other words, instead of: 


0 


0 
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take the following fake initial covariance matrix 


where we can adjust a to obtain a desirable optimal response using the first 

2 2 - 

design (E non-singular ) . An initial guess for a could be a - norm E , 

.L-L 11 

corresponding to saying that the fake standard deviation of x, „ has approxi- 

r - 10 r -n 


mately the same size as the one of x^. (Recall that x^ = 
and that we want to give a fake covariance matrix to x^. ) 


- iu r " 

x io"| x io 

L x ioJ L° . 


Now will lead to a unique H* through formulas (4*2*10, 11)* 

Write now H* = [H*, H*] and pick H = H*. Using formulas (4*6.6, 7), it 
is easy to compute the value of H*^ for the real design. 

This method is not the unique one, and the designer has entire freedom 
to choose so as to satisfy certain desired properties of the closed loop 
response of the system* The important fact to remember here, is that, once 
has been chosen, then is uniquely determined by ( 4 . 6 . 6 , 7 ). 

Example . We will now show by an example the procedure to follow. 

Consider the following completely controllable, completely observable, 
non -output stabilizable , time -invariant plant: 


x- l (t) 
x 2 (t) 
x 3 (t) 


1 X^t) 

o x 2 (t) 

0 X 3 (t) 


0 u (t ) 


y 1 (t) 
y 2 (t). 


x i (t) 
x 2 (t) 

x 3 (t) 
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1. Assume first that we have access to the complete state; then a 
cost (3.1.4) with: 

36 0 0 

Q = I 0 13 0 | and r = 1 


0 

0 


13 

0 


0 

14 


leads to the optimal control: 


u*(t)=[-6 - 5 -6] 


x 1 (t) 
x 2 (t) 

x 3 (t) 


and the optimal closed loop system 




- 



4 l<t) 


Oil 


X^t) 

x 2 (t) 

= 

0 0 1 


x 2 (t) 

X3 (t) 


-6 -5 -6 


. X 3 (t) . 


corresponding to the stable modes = 

if we assume E {x } = 0 and 
o 


- 3, s = 


- 2; s - - 1. Moreover, 


l = 
o 


0 

0 

0 


then the optimal cost is given by: 


J* = tr(K I ) =42 
1 o 


where JI is given by; 
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n = 


30 6 6 

6 18 5 

6 5 6 


2. Now in practice, we don't have access to x^(t), so that we will 
design an observer to estimate x^ft), and then apply the control: 


u(t) = [ - 6 - 5 -6] 


(t) 
x 2 (t) 
x 3 ct) 


Now it is immediately seen that is singular, 

will then take a fake Z^ to pick the value of h^. 


1 0 
0 0 


Consider Z 


Ilf 


1 0 
0 1 


(i* e. , a = 1) 


We 


Under these conditions, using formulas (4.2.10, 11) we obtain W = + 1, 

H* = [ 1 1 ] . We will then pick h^ - + 1- Putting this value of h^ in 

(4.6.6, 7) leads immediately to h* = i/J" - 1 . So that we will then imple- 

1 a 


ment a compensator with 


h = [/aT - i, i] 

For this design 

F = 0 - [/T - 1 

so that we will have e(t) = - J2 e(t). 

Remark. Note that for < ^y nam i cs of the error would have 
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been e(t) = - 2 e(t). The actual compensator is slower. This is exactly 
what we expected since/ in fact/ the real initial state covariance matrix 
is such that the initial expected error is smaller in the real design. The 
observer expends then less work in recovering from the initial error, and 
"slows down" its speed. 

As a verification of our results, we will try to minimize directly 

A 

(hi , h^) for this simple example. 

A ^2 (h l' = *22* h l' E o^ h l } 


From (4.6.1), we have 


e <h_> = h t + 2 

o 1 1 


and noting that F = - (h^ + h^) we have 


X 22 (h l' V = 


18 


h l + h 2 


(from 4.2.15 with M = 36) 


so that we obtain 


h i + 2 

A VV h 2 > = 18 


3 A . . . 18 
AOh (h n h ) = 


(It + 2h, h - 2) 


3h l "2' -x “1-2 
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18 




As expected , AJ (h , h ) is a monotonic decreasing function of h , and 
2 r 2 2 

a choice of h. =1 leads to h* = /3 - 1* 

2 a la 

The increase in the cost for our design, (H = [/3 -1,1] is then: 


AJ* - 36 (/3 - 1) 

Note that this increase could be made arbitrarily small by choosing 

/ 2 

h^ greater and greater and choosing accordingly h*^ = v2 + h^- h^. It 

can easily be seen that under these conditions, AJ* (h* , h ) = 

2 la 2a 

36 (Jl + h^ - h_ ) is a monotonic decreasing function of h . (Recall 
2a 2a 2a 

that h must be positive here, for stability of F) . 

Remarks The generalizations to handle particular initial conditions 
are straightforward. If E q (H) turns out to be a quadratic function of H, 
whose quadratic term possesses a positive definite coefficient matrix, then 
a unique minimizing H* exists, and the optimal compensator follows, with 


specified dynamics. 
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CHAPTER V 

THE CLASS OF OPTIMAL COMPENSATORS 


5.1 Introduction 

We derived in Chapters III and IV the solution for the optimal control 
of an n-th order plant having n-m inaccessible states, where the control 
is to be generated from a compensator of order n-m, and where the cost to 
be minimized is: 


. J 


{x' (t) Q x (t) + u' (t) R u <t) } dt 


I 


(5.1.1) 


We showed that the equivalent of the separation property holds for this 
problem, namely that the optimal compensator can be decomposed in two 
totally independent parts: a minimal -order Luenberger observer with 

optimally specified dynamics, and a controller where we used the optimal 
gains for the linear regulator problem. 

Now the uniqueness of the solution was mainly due to the fact that* we 
used a canonical form for the output matrix of the plant, namely: 


C = [I 

mxm 


° , J 

mx (n-m) 


(5.1.2) 


We saw in Chapter II how to go from any arbitrary n-th order plant with 
output matrix C of rank m. 



to an n-th order canonical plant: 


x (t ) = A x (t ) + B u(t); y(t) = C x (t) (5.1.4) 

where C is given by (5.1.2). We also showed how to transform an optimiza- 
tion problem for the initial plant into an equivalent optimization problem 
for the canonical plant. It is interesting now to return to the initial 
plant and see what happens to the optimal compensator* 

Recalling that the similarity transformation P to go from (5*1*3) to 
(5.1.4) is not unique, it is expected that the optimal compensator for the 
initial plant will not be unique. However, we will define precisely the 
class of all optimal compensators. 

We will now show precisely how to go back to the initial plant, then 
prove that the total cost for the overall canonical plant is the same as 
the total cost for the overall optimal initial plant, whatever P matrix we 
use to go from one to another. We will then define the class of all 
optimal compensators for the initial plant, and finally provide a numerical 


example. 



so 


5.2 Equality of the Total Costs 

Recall from Chapter II that we go from (5.1-3) to (5.1.4) by using the 
similarity transformation: 

x(t) - P x(t) (5.2.1) 


where we have defined: 



such that C given by (5.1.2) becomes: 


C 


C P 


-1 


(5.2.2) 


(5.2.3) 


Recall that the nonuniqueness of P is due to the fact that T is arbitrary 
as long as its row vectors span the orthogonal complement of the range of 
C. Recall also the relations between the two systems: 

A — P A P X ; B = P B; C = C P _1 (5.2.4) 

Q = (P') _1 Q P _1 ; R = R? K* = K* P _1 (5.2.5) 


In the following , any symbol with an overbar will be related to the initial 
given plant, any other symbol without this overbar will be related to a 
particular canonical plant associated with a given P transformation. 

The overall optimal canonical system is given by (3.3.4) and (3.3.5) 
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that is: 


£(t) = r* £(t) 


( 5 . 2 . 6 ) 


where we have defined: 


r* 


A + BK* - BK* N 

0 F* 


( 5 . 2 . 7 ) 


for which F = ~ H * A ^2 9 overns the dynamics of the error e (t) = (t) 

x 2 (t). 

Define now the transformation: 


P = 


P 

0 


0 

I 


p- 1 


then if £ (t) 


x(t) 
e (t) 


we see that 



K (t) = P 1 Ut) 


( 5 . 2 . 8 ) 


( 5 . 2 . 9 ) 


so that we can easily come back to the initial overall optimal system asso 
dated with the particular P matrix used* Namely, we have: 


Ht) = r* 5(t) 


( 5 . 2 . 10 ) 


where T* is given by: 
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r* = p r* P = 


a + BK* 
0 


- BK* P -1 N 


F* 


A simple look at this matrix shows that there will be as many optimal 
compensators as there are P’s. 

Now the total cost associated with (5-2. ) is given by: 


J* = tr (A* H*) (5.2.11) 

2 o 

where A is given by (3.3.24) and ^ by (3.3.17) all quantities being computed 
for the optimal H, 

On the other hand, the total cost associated with (5.2.10) is given by: 

J* = tr (A* E* ) (5.2.12) 

Z o 

where A* is the solution of: 


a r* + r*' a + ft = o 


(5.2.13) 


where ft is given by: 


Q + K* ’ R K* 

- N ' (P 1 ) _1 K*' R K* 


- K* 1 R K* P -1 N 
K*' R K* 


(5.2.14) 


It is a simple matter now, using formulas (5,2.8, 9) and the different 
relations between the two systems, to see that: 
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5 = P 5 ?•; Q = P’ P (5.2.15) 

o o 

so that from (5.2.13) follows: 

A = P* A P (5.2.16) 

Now with these formulas, a very simple calculation, similar to the one we 
did in Chapter II, yields: 

J* = tr(A* S*) = tr (A* §*) = J* 

2 o o 2 

In other words, we simply showed that whatever similarity transformation 
we used to go from the initial plant to the canonical plant, the total costs 
and consequently the increases in the costs (as we already proved (Chapter II) 
that the constant parts of the costs were equal), are the same. We proved 
here a first desirable property of the design; although for an arbitrary 
plant there is a class of optimal compensators rather than a unique one/ 
all the compensators in the class lead to the same increase in the cost. 

We will return now to the variables x(t) and z (t) , and give precisely the 
compensator we would implement around the initial plant. 
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5 * 3 Optimal Compensator. Initial Plant 

We will return now to the practical variables x(t) and z (t) , as we 
will finally implement z(t) and not e(t). 

From (3.2.3), recall that the optimal minimal-order Luenberger observer 
is given by: 

z(t) = F* z(t) + G* x x (t) + D* u (t ) (5.3.1) 

x^(t) - z(t) + H* x i (t) (5.3.2) 

where we have defined 


F* 


= A 


22 


H*A 


12 


G* = F* H* + (A - H*A ) 
^ XX 


D* = B 2 - H*B^ 


Now the implemented control is; 


(5-3.3) 


U (t ) - KJ X 1 (t) + K* x 2 (t) = (KJ + K* H*) x^t) + K* z(t) (5.3.4) 

So that we can use u(t) given by (5.3.4) in (5.3.1), to yield the "compact" 
form of the compensator: 


z(t) = F* z(t) + G* x^(t) 


(5.3.5) 



85 


where we have : 

(5.3.6) 

x(t) 

for all designs whatever P is, and that 
x 2 (t) = T x(t) will depend on the choice of T. 

It is then intuitively clear in view of (5.3.4), that x^ (t) being the 
same for all designs, the quantity K* + K* H* should be an invariant, (i e. , 
independent of the P transformation), although K* = K* P 1 and as such K*, 

K* depend on P, and H also. We will clearly demonstrate this result in 
Section 5.4. 

Let us now proceed a step further. Define as we did in Chapter III, 
Section 3.2: 



then the overall optimal canonical plant is 
i (t) = C (t) 


F* = F* + D* K* 


G* = G* + D* (K* + K* H*) 


Now recall that: 


x(t) = 


"x^t) 


c 

_ x 2 (t) _ 


T 


so that x^(t) = C x(t) is the same 


where we have from (3.2.15): 
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= 


A + B(K* + K* H*) C BK* 

G* C F* 


Using P defined in (5.2.8), consider 


?(t) = P' 


i T x<t) l 

c ( t> = |_ 

[z (t)J 


we can then return to the initial given plant. Namely, we have 


t(t) = C(t) 


(5.3.7) 


where <f* satisfies: 


,-l 


$* = P A p = 


A + B(K* + K* H*) C 


G* C 


B K* 

2 


F* 


(5.3.8) 


We will then implement in practice the following compensator around the 
given initial plant : 


z (t) = F* z (t) + G* y (t) 


(5.3.9) 


u (t) = (K* + K* H*) y (t) + K* z (t) (5.3.10) 

A A 

Note that F* , G* K* depend on the particular similarity transformation P 
we have taken to go from the initial plant to the canonical plant and back. 
Now we have already suggested that (K* + K* H* ) should be an invariant: 
in view of (5,3.10), it must be so and we will prove it soon. Another 
desirable property should be that F* have the same dynamics whatever P 


we choose . 



Define now: 
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Yls) = 

L[y(t) ] 


Z (s) = 

L[z (t) ] 

(5.3.11) 

U (s) = 

L[u(t> ] 



where L denotes the Laplace transform? then the overall transfer function 
of the compensator is: 

y (s) = -y S i = K* (si - F*) _1 G* + (K* + K* H*) (5.3.12) 

Y (s) Z L z 

Clearly this transfer function should also be independent of the P 
matrix we have used to go to the canonical plant and back- (We insist that 
all the matrices in (5.3.12) depend on P) . 

We will now prove these desirable properties rigorously, and precisely 
determine the class of all optimal compensators for a given arbitrary plant 
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5.4 The Class of Optimal Compensators for a Given Plant 

Recall that the P matrix to go from (5.1.3) to (5.1.4) is given by: 


where C is the mxn output matrix of full rank m of the initial plant , and 
T is an arbitrary (n-m)xn matrix such that P is non-singular. 

Partition now the inverse of P, namely: 


(5.4.1) 



a* , 
T ] 


(5.4.2) 


a* 

where C is nxm and T is nx (n-m) . 

C and T are respectively of full rank m and n-m as P is obviously non- 
singular, they also both depend on the particular arbitrary T we have chosen 


in (5.4.1) . 

Now from the condition P ^ P = I we obtain: 

n 


C C + T T = I (5.4.3) 

n 


and from the condition P P 


I , we have also: 
n 


- a. 
C C 


'V 

I ; T T 


m 


I 

n-m 


- 'll 

C T “ 


0 


rax (n-m) ' 


T C 


0 


(n-m)xm 


(5.4.4) 
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It is now a simple matter to compute explicitly the A and B matrices for 
a canonical plant, as a function of A, B and the P transformation. Namely, 
we have: 


- - 

C A C 

CAT 


*n 

*12 ■ 

- a 
TAG 

- ^ 
TAT 


. A21 

*22 


“ “ 


~ - 

C B 


B i 

T B 


B 2 

_ - 


— 


(5.4.5) 


(5.4.5bis) 


the A ‘s and B.’s we have used in Chapters III and IV are then readily 
ij x 

identified. It is clearly seen that there are as many canonical plants 
as there are P matrices. However, what we proved is that for each of 
them the optimal compensator was unique . 

Similarly, we have for the initial covariance matrix: 



r -- 1 


r. _ "1 


CSC 

C E T' 


L n 

L 12 

M 

11 

hj 

M 

W 

II 

i-3 

Ml 

O! 

T E T' 

= 

_ Z 12 

Z 22 


The E . . ' s we have used in Chapters III and IV are also readily identified. 
ID 

Note that E = C E C' is the same for all transformations P, and corres- 
ponds to the initial output covariance matrix of the given initial plant . 

What we will do now is simply reconsider the equations for the unique 
optimal compensator, associated with a particular canonical plant, and 
use in these equations the values of A^ and E^ given by (5.4.5) and 
(5.4.6) . 

We will only consider here the case where the initial output covariance 
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matrix Z = C Z C' , is non-singular, 

1, Equation for W. From (4,2.13), we have that W is the unique posi- 
tive definite solution of: 


22 


W + 


WA ' 
22 


+ Z 


22 


- <WA i2 


E' ) 
12 1 


,-l 

11 


(WA 


12 


E‘ ) ' = 
12 


Using (5.4.5) and (5.4.6), we obtain: 


T A T W + W T'A’T' + T E T - (W T'A'C' + T E C') (C E C ' ) _1 

(W T'A'C’ + T Z C')' = 0 (5.4.7) 

Define now 

W = T W T’ (5.4.8) 

t i 'V 

Note that W is nxn. A left multiplication of (5.4.7) by T and a 

'll 

right multiplication by T' yields, after use of (5.4.3), the following 
equation for W 

A 0/ + W A' + Z - (W A’ + E) C' (C E C’) -1 C (W A' + E) ' = 0 (5.4.9) 


Note that this equation involves only quantities related to the initial 
given plant. As a conclusion, W does not depend whatsoever on P. This is 
our first invariant, and W really characterizes the given plant (A, B, C) . 
All the W’s for the different canonical plants will then be related to 
this 0/ by (5.4.8) . 
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According to the fact that we assumed the pair (A, C) to be completely 
controllable, and also because C £ C' is positive definite, it can be shown 
that (5.4.9) has always at least one positive definite solution W. 

2. Equation for H* . We will now considers 


H* 


(W A 


12 


1 1 ) 
12 * 


.-1 

'll 


(5.4.10) 


Using (5.4.5) and (5.4.6), we have: 


H* 


(W T'A'C' + 


TEC'Hcic') 1 


(5.4.11) 


Consider now T H* , namely, we have 


T 


H* — (T W 


'V — — 

T’A'C 1 


T 


fc'Mcic’l 1 


Using (5.4.8) and (5.4.3), it comes immediately: 
T H* = (W A ' C ' + I C') (C f C ’) _1 - C 


Define now 


H* = (III A 1 + E) C' (CZC') 


(5.4.12) 


(5.4.13) 


we see immediately that H* (an nxm matrix) is a second invariant as it 
involves only terms related to the given initial plant. 
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All the different H* for different canonical plants will then be 
related by 

T H* = H* - C (5.4.14) 

We can now proceed and express all the matrices for the optimal com- 

r\j 

pensator in terms of W, H* and T. We will then have precisely determined 
the class of all optimal compensators. 

3. Equation for F* . Consider now F* = - HA*^ , which governs the 

dynamics of the error. We have from (5.4.5) and (5.4.6) 

F* = T A T - H* C A T (5.4.15) 

a* 

A left multiplication by T and the use of (5.4.14) and (5.4.3) yields 

T F* = (A - H* C A) T (5.4.16) 

Define now a new nxn invariant matrix: 

F* = A - H* C A = (I - H* C) A (5.4.17) 

n 

■ ^ _ 

then recalling that T has full rank (n-m) , we can make use of the pseudo- 
inverse matrix (see Zadeh and Desoer [28]) to write: 


F* = T F* T 


(5.4.18) 
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where we have defined 


'V f* m ^\j *\j —1 'X* 

T = (T* T) T' 


{5.4.19) 


and where F* is given by (5.4.17). 

We have then precisely defined the structure of all the F* matrices 

a, 

as a function of the invariant F* and the transformation T. Namely, given 

f\j 

P, T is automatically fixed and F* follows. 

— nj 

Now recall that C and T are respectively of full rank m and n-m. 

From (5.4*4), we have C T = 0 which implies that R(T) is contained in N(C). 
Since the null space of C has dimension n-m, we see that the range space 

f\, ■'Vj 

of T spans it exactly, whatever P matrix we choose (i.e. , whatever T) . 

‘^*4* . 

We will then say that the dynamics of F* = T r* T are those dynamics of 
F* which are associated with the null space of C, and as such will be the 
same whatever T we have. For more details on this concept, see [27], and 
also [29], Appendix II, about pseudo -inverses. 

We then have the desirable expected property that all optimal compen- 
sators in the class will have the same dynamic s- 

4* Equation for F* = F* + D* K^ . Recall from (5.4.5bis) that = 

- - - -l - ^ 

C B and B^ ~ T B; recall also that K* = K* P leading to K* = K* C and K* 

— 

K* * T ; we then obtain: 


D* K* = (B 2 - H* B x ) K* 


D* K* — (T B - H* C B) 


- n. 

K* T 


(5.4.20) 
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% 

A left multiplication of (5.4.20) by T and the use of (5.4.3, 14) yields 
T D* K* = (B - H* C B) K* T (5.4.21) 

Define a fourth nxr invariant matrix: 


V* = B - H* C B = (I - H* C) B (5.4.22) 

n 

Then using the pseudo -inverse of T, we have in passing: 

D* = (5.4.23) 


and for our present interest: 


D* K* = T V* K* T 


Note that K* is directly related to the initial given plant. We have now 


F* = T + (F* + V* K*) T 


(5.4.24) 


so that we can define 


F* = F* + V* K* 


(5.4.25) 


or using (5.4.17) for F* 


(5.4.22) for V* 
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A 

F* = (I - H* C) (A + BK*) (5.4.26) 

n 

(Recall that A + BK* governs the dynamics of the optimal given plant assuming 
complete feedback. ) 

We can give now the general relation between all the matrices F* 

* Z ''o 

F* = T F* T 


Mote that, as mentioned above for F*, the dynamics of F* are those dynamics 
of F* associated with N(C) and do not depend on T. 

5. Equation for K * + H* 

It was intuitively clear from (5.3.10) that this quantity should be 
invariant. We can now prove it very simply. 

K* + K* H* = K* C + K* T H* (5.4.27) 

Now from (5.4.14): T H* = H* - C so that (5.4.27) becomes: 

K* + K* H* = K* C + K* H* - K* C 

-L ^ 

that is, we have the fifth rxm invariant matrix: 


K* + K* H* = K* H* 


(5.4.18) 
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6. Equation for G* - G* + D* (K * + H*) . It remains only now to 

A 

determine the value of G* as a function of some other expected invariant. 
We will first evaluate 


G* = F* H* + <A 21 - h * a i:l > 


(5.4.29) 


Note first that: 

A - H*A 11 = TAC-H*CA$ 

*\y 4- 

so that a left multiplication by T and the use of the pseudo-inverse T 
yields: 

A 21 “ H * A 11 = T + F* C (5.4.30) 

where F* is given by (5.4.17). 

It is easy to see now that 

'Yif r- r 0+ _ 

G* = T F* T H* + T F* C 


so that using (5.4.14) 


G* 


T + F* H* = T^G* 


(5.4.31) 


where we have defined the new nxm invariant matrix 
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G* = F* H* = (I - H* c) a H* 

n 


It is now a simple matter to see that 

G* = T + (3* + T + V* K* H*, 


namely we have 
G* = T + G* 


where we have defined the nxm matrix: 
G* = G* + V* K* H* 


or equivalently in view of (5.4.36) 
G* = F* U* + V* K* H* 


(5.4.32) 


(5.4.33) 


(5.4.34) 


(5.4.35) 


which can be rewritten using (5.4.22) and (5.4.26) 

G* = (I - H* C) (A + BK*) H* = F* H* 
n 

We insist that all matrices denoted by script letters are directly related 
to the initial given plant, as well as those which have overbars. We have 


then given a complete picture of the structure of the gains for the class 
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of all optimal compensators. 

We will conclude by showing that as we expected the overall transfer 
function for all compensators in the class, is indeed the same. Namely, 
(5.3.12) becomes: 


— r o+ * ^ — 1 'vh ~ — 

f (s) = K* T (si - T F* T) T G* + K* H* 
n-m 


(5.4.36) 


'Vf 'Xtf Or 

from the fact that T T “ I = T I T, we can rewrite (5.4.36) as 

n-m n 


V(s) = K* T[T + (sl n - F*) T ]" 1 T f G* + K* H* 


(5.4.37) 


It can be shown ([27] * [29]) that ${s) defined as: 

■Ms) = T[T + (sl n - F*) T] -1 T + 

is independent of T, and corresponds as we previously mentioned to those 
dynamics of F* associated with the null space of C. We then have 


S' (s) = K* $ (s) G* + K* f/* 


(5.4.38) 


This last formula is certainly the most important result in this thesis. 

We have an explicit formula for the optimal compensator transfer function, 
involving uniquely quantities related directly to the given arbitrary plant . 
This result as well as the entire Section 5.4 is original. 

The results in [7] duplicate only one special case of Chapter IV, namely 
when E {x^} = 0, z(Q) = 0; no explicit general solution is given when the 
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plant is not in the canonical form assumed in Chapter IV. We have then 
rederived the partial results of [7] by a direct method, and more important, 
completely solved the problem of finding the optimal compensator based on 
a minimal order Luenberger observer, for any arbitrary given plant. 

A numerical example will now be provided. 
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5 . 5 An Example 

Consider the following non-canonical , completely controllable, 
completely observable, initial plant: 


~ 


- 


■*_ 



x 1 (t) 


0 1 


x 1 (t) 


0 






+ 


_x 2 (t)_ 


0 -2 


x 2 


1 


u(t) 


(5.5.1) 


y (t) = [l l] 


x x (t) 
x 2 (t> 


(5.5.2) 


If we assume complete state feedback and take a cost given by (2.1.3), 


where : 


Q = 


4 

0 


0 

1 


and r = 1 


(5.5.3) 


then the optimal control is given by 


u*(t) = K* x (t ) = [-2 -1] 


x 1 (t) 

i 2 a) 


(5.5.4) 


Defining 


E (x x'} = l = 
o O 0 


1 1 
1 2 


(5.5.5) 


the optimal cost is in this case: 


J* = tr (JI E ) = 12 
1 o 


(5.5.6) 
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where H is given by: 


n - 


6 

2 


2 

1 


( 5 . 5 . 7 ) 


Now, in practice, we don’t have access to the complete state, so that 
we want to design an optimal compensator for this plant. 

A. Consider now a first similarity transformation P^, 


P 1 = 


1 

0 


1 

1 




1 

0 


-1 

1 


( 5 . 5 . 8 ) 


such that the initial plant now take the following canonical form: 


*1 (t) 


0 

-1 


(t) 

+ 

1 

Vt>_ 


0 

- 

-2 


_x 2 (t) _ 


1 


u (t) 


( 5 . 5 . 9 ) 


y (t) = [1 0] 


(t) 

x 2 (t) 


z = 

o 


5 3 

3 2 


and K* = K* P 1 - [-2 1] 


( 5 . 5 . 10 ) 


( 5 . 5 . 11 ) 


It is immediately seen using Chapter IV that the optimal compensator 
for this canonical plant is uniquely determined by h* = 2 - / 2 , leading to 

± (t) = - /2 z (t) + (2 - 2 /if) x 1 (t) + (S2 - 1) u (t) (5.5.12) 


..’here u(t) is given by: 
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u (t) - - ^2 x^(t) + z(t) 


( 5 . 5 . 13 ) 


so that the compact equation for z(t) is 


z(t) = - z(t) - /2 x^(t) 


( 5 . 5 . 14 ) 


Now the overall canonical system will be: 


‘V tr 


- _ /2 -1 1 ' 


" x i (t) " 


= 

- /2 -2 1 


x 2 (t) 

_z(t) _ 


1 

^1 

0 

1 

M 

1 


_z (t) _ 


so that we can return to the initial plant by defining: 



" 1 1 0 - 


- 1 -1 0 " 

I—* 

II 

0 10 


0 10 


_ 0 0 1 _ 


0 0 1 


We then obtain: 


( 5 . 5 . 16 ) 


-Xi<tf 


* 0 

1 0 - 


"x^tr 

x 2 (t) 

= 

- /I 

-2-/2 1 


x 2 (t) 

. 2 (t) _ 


& 

1 

iH 

1 

1 


_z(t) 


( 5 . 5 . 17 ) 


leading to the practical implementation for the initial plant: 


z(t) = - z (t ) - /2 y(t) 


( 5 . 5 . 18 ) 
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u(t) = - /2 y{t) + z (t) (5.5.19) 


whose transfer function is: 

V (s) = Si£l = _ /2 - — (5.5.20) 

Y(s) s + 1 

It can easily be shown that, as expected; 

A 

AJ 2 * = AJ* = 5 /F - 7 = 71.10 (5.5.21) 


B. Consider now a second similarity transformation P 2 : 


r- n 


r 1 

1 

1 

p- 1 - 

0.5 

0.5 

1 

-1 

2 

0.5 

-0.5 

L J 


L J 


(5.5.22) 


such that the initial plant now takes the following different canonical 
form: 




_ 


- " 


“ ” 

x 1 (t) 


- 0.5 0.5 


x^ (t) 

+ 

1 

x 2 (t) 


1.5 -1.5 


_x 2 (t)_ 


-1 


y (t) - [1 


0] 


x 1 (t) 

x 2 (t) 


E = 
o 


5 

-1 


-1 

1 


K* = K* P 1 = [ - 1.5 -0.5] 
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We will then implement the optimal compensator uniquely determined by 
h* = - 3 + 2 /2, leading to 

z (t) = - /2 (t) + 4 {/2 - 1) x^ (t) + 2 (1 - Jl) u (t) 

u (t ) - - /2 x^(t) - 0.5 z(t) 

so that the compact equation for z(t) is: 

z(t) = - z(t) + 2 /2 X x (t) (5.5.26) 

The overall canonical system will now be: 

-0.5 - /J 0.5 -0.5 “I r* <tr. 

1.5 + /l -1.5 0.5 x 2 (t) (5.5.27) 

2/2 0 -1 z (t) 



(5.5.24) 

(5.5.25) 
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leading to the practical implementation for the initial plant: 


z(t) = - z (t) + 2 1/2 y(t) (5,5,30) 

u(t) = - /2 y (t) - 0.5 z(t) (5,5.31) 


whose transfer function is: 


Y 2 (s> 


= - /2 - 

Y(s) 


/? 

s + 1 


(5.5.32) 


It is immediately seen that the increase in cost due to the implementa- 
tion of this compensator is still given by (5.5.21). 

It is clearly seen that 

1. ¥ x (s) = ^(s) See (5 ‘ 5 * 20) and (5*5.32) 

2 . (k* + k* h*> = (k* + k* h*) 2 = - S2 

as expected . 

We will now make use of Section 5.4 and compute the invariant quan- 
tities for the initial non-canonical given plant. 

From (5.4.9) , we have 


" W 7 + /50 ■ 

_ 7 + /SO -7 + /SO _ 


(5.5.33) 


where for positive definiteness must be greater than (7 + /50) 3 . 
Now it follows that: 



(5.5.34) 
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and from (5.4.17) that 


F* - 


[• -* i 

Lo - g J 


(5.5.35) 


from (5.4.22) we have also 


V* = 


[ l + *l 

L-i + a J 


(5.5.36) 


So that from (5.5.4), (5.5.35), (5.5.36) and (5.4.25) 


F* 


•[ 


-2 -2 /2 
2 -2 SI 


-1 -2 /2 I 

1 -2 /2 J 


(5.4.37) 


Now, we have from (5.4.32) 


■ 2 - 2 /2 ' 

G* = 

_ 2 -2 v'T 


So that from (5.4.35), we obtain 


G* 




p ./T ^,/T 


./O j . 1 ) 

■3 v'A. T j./ 

3 /2 (/J- 1) 


I 

J 


Noting that K* = [-2 -1] and K* H* = 3 v7, it follows that y (s) given 

by (5.4.38) is the same as y (s) given by (5.5.32) or (5.5*20), namely 


Y(s) = - /2 - 




S + 1 


. t -1 % 

It is also possible from the matrices P^ and P^ to pick the TVs i = 1, 2 
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and to verify that all the formulas in Section 5.5 hold* 

Important Remark . Note that from a practical computational standpoint, 
it is not advantageous to work with the script matrices/ related to the 
initial given plant. As an example, the Riccati-type equation for W is of 
order n, instead of n-m for W* 

Knowing that all the compensators will lead to the same optimal trans- 
fer function, it is then computationally more efficient to pick a P, go to 
a canonical plant, compute the canonical optimal compensator, and go back 
to the initial plant by using the P matrix defined in (5.2*8). 
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CHAPTER VI 
CONCLUSION 


We have revisited and solved in this thesis the problem of finding the 
optimal n-m order compensator based on a minimal-order Luenberger observer. 
For any given completely controllable and completely observable plant, we 
have shown that there exists a class of optimal compensators, specified by 
a unique optimal transfer function* Moreover, all compensators within this 
class lead to the same cost and the same overall dynamics for the augmented 
system plant + compensator: the optimal dynamics for complete state feed- 

back, combined with the optimal dynamics of the minimal order Luenberger 
observer* The interesting result was that a generalization of the separation 
property holds for this design, (i.e., design of the observer part of the 
compensator is completely independent of the controller part) . 

In case the plant is specified by a canonical output matrix (5*1.2), 
we have seen that the optimal compensator is unique and that its realization 
is specified by Equations (4.2.11), (4.2.13), (3.2.10), (3*2.11). 

Consider now the linear stochastic system, with perfect measurements: 


x(t) - A x(t) + 



v(t> 


0 


y (t) = c x (t) 


where x(t), A are partitioned as in (3.2.1) and where C has the canonical 
form (5.1.2). Assume that v(t) is a white noise process specified by: 
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E (v(t)} = 0 E {v(t) v'(t)} = 6 (t - t) 

for all t and t. It is interesting to note that Equations (4*2.13) and 
(4.2*11) are exactly the equations for the error covariance matrix and gains 
associated with the Bryson-Johansen filter we would implement to obtain an 
estimate of (t) from the perfect measurements y(t) = x^(t). This justifies 
completely the approach taken by Rom and Sarachik in [7]. Section 5.4 gives 
general equations for the case where the plant is not in the canonical form 
assumed in Section 4.2. Notice also that = C E C’ > 0 is equivalent to 

the condition that the Bryson-Johansen filter does not contain differentia- 
tors. For more details on the Bryson-Johansen filter, see [22] and also 
Jazwinski [30], example 7.14, page 228. 

We will now indicate a possible extension of the design philosophy we 
used towards compensators of lower dimension- 

Given the optimal control 

u*(t) = K* x ( t ) - K* x x (t) + K* x 2 <t) 

for the linear regulator problem (see (3.2.1) for the partition of x(t)) 
this thesis was concerned mainly with the reconstruction of (t) from 
y (t ) = x _ (t ) in order to implement the control 

u (t ) = K* x x (t) + K* x 2 (t) 
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Having only m available outputs, our compensator was of fixed order n-m. 
Now it might be very interesting and rewarding to consider the direct 
global estimation of K* x 2 (t). We see immediately that, as in this case 
we are trying to observe linear functionals of the state, the dimensions 
of the required observer may be far less than n-m. In particular, when 
u(t) is a scalar control, Luenberger [25] has shown that any linear func- 
tional of the state can be observed with an observer of order v - 1* Here 
v (see [25]} is the observability index of the plant, defined as the least 
positive integer such that the matrix: 

[C, A’ C', , (A* ) V “ 1 C'] 

has full rank n. Since for any completely observable system v-1 n-m, 
and for many systems v-1 is in fact far less than n-m, this approach could 
greatly reduce the order of the compensator. When u(t) is vector-valued , 
it is also reasonable to anticipate the existence of a lower-order compen- 
sator. The problem is then to use the approach taken in this thesis to 
find the optimal dynamics for this reduced order observer. We will recall 
the three steps of our approach: 

- Given an arbitrary plant, go to a canonical form by similarity 
transformation. (Chapter II) 

- Work out the problem for this new plant. (Chapters III, IV) 

- Go back to the initial plant. (Chapter V) 

We emphasize once again the importance of working with a canonical 
plant if one wants to understand clearly the internal structure of the 
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compensator (see the formulation by Nevffnan [5], [6] to appreciate the 
relevance of this remark) . 
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